Detection and Diagnosis of Smoking Related Cancers

ABSTRACT

Gene probes for specific regions of chromosome 3 (3p21.3) and chromosome 10 (10q22) have been found to be tools for the diagnosis and prognosis of smoking related cancers such as non-small cell lung cancer (NSCLC). For example, these probes can be used with fluorescence in situ hybridization (FISH), and used to stratify smokers into high and low risk groups, as well as determine a patients susceptibility to the development of smoking related cancers.

The current application claims priority to provisional application 60/222,811 filed Aug. 4, 2000, herein incorporated by reference.

The government may owns rights to this invention pursuant to NCI, Dept. of Health and Human Services contract number N01-CN-85184.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to the fields of oncology, genetics and molecular biology. More particular the invention relates to the use of two probes for regions of human chromosomes 3 and 10 that are highly predictive of the development of neoplasia and progression of neoplastic events.

2. Related Art

Lung cancer is one of the leading causes of cancer death in the world. The high mortality rate for lung cancer probably results, at least in part, from the lack of standard clinical procedures for the diagnosis of the disease at early and more treatable stages compared to breast, prostate, and colon cancers. There is also extremely poor prognosis associated with diagnosis of the disease, especially in advanced disease. It is important that strategies to detect early stage lung carcinoma or its precursors, such as atypical squamous metaplasia, dysphasia and carcinoma-in-situ in subjects at high risk be devised.

Cigarette smoking over a prolonged period of time is the most important risk factor in the development of lung and other smoking related cancers, with other risk factors including exposure to passive smoking, certain industrial substances such as arsenic, some organic chemicals, radon and asbestosis, ingestion of alcohol, radiation exposure from occupational, medical and environmental sources, air pollution and tuberculosis. Many of these factors greatly increase the risk of development of lung and other smoking related cancers if they occur in a person who is concurrently a smoker.

Genetic detection of human disease states is a rapidly developing field (Taparowsky et al., 1982; Slamon et al., 1989; Sidransky et al., 1992; Mild et al., 1994; Dong et al., 1995; Morahan et al., 1996; Lifton, 1996; Barinaga, 1996). However, some problems exist with this approach. A number of known genetic lesions merely predispose to development of specific disease states. Individuals carrying the genetic lesion may not develop the disease state, while other individuals may develop the disease state without possessing a particular genetic lesion. In human cancers, genetic defects may potentially occur in a large number of known tumor suppresser genes and proto-oncogenes.

The genetic detection of cancer has a long history. One of the earliest genetic lesions shown to predispose to cancer was transforming point mutations in the ras oncogenes (Taparowsky et al., 1982). Deletion and mutation of p53 has been observed in bladder cancer (Sidransky et al., 1991). Numerous studies have shown deletions in the 3p region are related to lung and other smoking related cancers (Mitsudomi et al., 1996, Shiseki et al., 1996, Wistuba et al., 2000, Wu et al., 1998, and Shriver et al., 1998).

Molecular studies (fluorescence in situ hybridization (FISH) for polysomies, PCR for hypervariable markers (MI) and LOH, or specific mutations) have demonstrated that morphologically normal areas of bronchial epithelium closest to the carcinomas frequently show the most molecular abnormalities (3p, 17p, 9p, 5q). In particular, the short arm of chromosome 3 has been shown to frequently harbor deletions of alleles in several regions including 3p25-26, 3p21.3-22, 3p14 and 3p12. These regions are presumed to be the site of tumor suppressor genes, and loss of chromosome 3p allelles have shown to be an early event in lung tumorigenesis.

Chromosomal alterations in several cancers have been investigated, and frequent LOH at chromosome 10 has been reported in a variety of cancers, including glioma, glioblastoma multiforme, prostate cancer, endometrial cancer, chondrosarcome, bladder cancer, malignant melanoma, and follicular thyroid tumors ((Licciardello et al., 1989; Auerbach et al., N. Engl J. Med., 265: 253-267, 1961; Voravud, et al., 1993; Feder et al., 1998; Yanagisawa et al., 1996; Thiberville et al., 1995; Papadimitrakopoulou et al., 1996; Zou et al., 1998; Brugal et al., 1984; Dalquen et al., 1997; Muguerza et al., 1997).

Deletion rates of chromosome 3p are known to correlate with lung cancer. However, there is no current clinical method for the identifying a population of individuals who are at a high risk to develop lung cancers or upper airway primary or secondary cancers. A technique for determining the risks of developing these cancers would be of great value for the ability to limit exposure to additional environmental risk factors and to know when additional tests, supplements, or treatments are appropriate.

In various studies, chromosome deletions have been studied as identifiers for lung cancers. For example, Shiseki et al., (1996) analyzed 85 loci on all 22 autosomal chromosomes to determine that the incidence of LOH on chromosome arms 2q, 9p, 18q, and 22 q in brain metastases were significantly higher than that in stages I primary lung tumors. Mitsudomi et al. (1996) used PCR-based analysis for the detection of LOH in non-small cell lung cancer. Multiple regions on chromosome 3p were observed to show that deletions of the 3p chromosome may help to identify non-small cell lung cancer patients with a poor prognosis. Wistuba et al. (2000) used fifty-four polymorphic markers used to study the entire chromosome arm 3p and concluded that 3p allele loss is nearly universal in lung cancer pathogenesis. Wu et al. (1998) studied 3p21.3 deletion using the probe, D3S4604/luca. Peripheral blood lymphocytes of 40 lung cancer patients were observed to give the conclusion that lung cancer patients exposed to benzo[α]pyrene, a common byproduct of tobacco smoke, have frequent deletions in peripheral blood lymphocytes. Shriver et al. (1998) studied lung cancer cell lines and identified the human homolog of the L14 ribosomal protein gene, RPL14; deletion of RPL14 was shown to be related to the development of lung cancer. None of theses studies, however, are able to predict the susceptibility of a patient to the development of lung cancer or to predict whether smokers and non smokers are at a high risk of developing lung or other smoking related cancers.

Because of the grim prognosis of lung cancer with a ten year survival rate of <5% the only curable cancers are those diagnosed in the early stages and treated surgically. There is a shift of interest towards diagnosis and study of early and preneoplastic states. Because early detection and effective chemoprevention therapy have potential to be curative, it is imperative to stratify the patients in clinical trials. These patients need to be monitored fore results of chemoprevention therapy and also for predictions whether a particular preneoplastic lesion may progress.

SUMMARY OF THE INVENTION

The present invention provides probes located on chromosomes 3p21.3 and 10q22 useful in the diagnosis and prognosis of cancers related to smoking. In one embodiment, a method for identifying a subject at high risk for the development, recurrence, or metastasis of cancer comprising the steps of (a) obtaining a test sample from a subject; (b) providing a nucleic acid probe targeting RPL14, CD39L3, PMGM, or GC20; (c) contacting the probe with the test sample; and (d) analyzing DNA from the sample whereby aberrations in the hybridization of said probe to said DNA was compared to wild type DNA, indicating the risk for the development, recurrence, or metastasis of cancers.

More specifically the method identifies the risk for the development of cancers. The cancer may be lung, upper airway primary or secondary, head or neck, bladder, kidneys, pancreas, mouth, throat, pharynx, larynx, esophagus, brain, liver, spleen, kidney, lymph node, small intestine, pancreas, blood cells, colon, stomach, breast, endometrium, prostate, testicle, ovary, skin, bone marrow and blood cancer. In preferred embodiments, the cancer is lung cancer. The test sample can include, but is not limited to, a surgical or biopsy specimen, paraffin embedded tissue, frozen tissue, surgical fine needle aspirations, bronchial brushes, bronchial washes, bronchial lavages, buccal smears, sputa, peripheral blood lymphocytes, esophageal brush, a fine needle aspiration, urinary specimens such as bladder washings and voided urine, and esophageal washes.

In one embodiment, it is provided that the subject can come from a group comprising smokers, former smokers, or non-smokers. In a similar embodiment, the test sample comes from said subject who has not previously been diagnosed with cancer.

It is a further embodiment of this invention that additional testing, agents or treatments may be performed after the risk for the development of said cancers has been analyzed. This includes, but is not limited to, a spiral CT-scan, cancer therapies and pharmaceutical treatments which can include radiotherapeutic agents, surgical treatment for removal of the cancerous growth, chemotherapeutic agents, antibiotics, alkylating agents and antioxidants, biological modifying respidase drugs and other agents. These agents and treatments can be used alone or in combination with other agents.

In certain embodiments, it is contemplated that FISH is used to measure the aberrations in the particular loci. A unique 3p21.3 probe can be from 1000 to 2000 base pairs or larger and used for detection in a region of about 180,000 base pairs. The probe can be labeled with a fluorophore, or more specifically digoxigenin. A specific 10q22 probe can be used in conjunction with the 3p21 probe. In certain embodiments, a control probe is used which can be labeled with a fluorophore, or more specifically spectrum orange. The control probe is a chromosome 3 stable marker or more specifically Centromere 3 (CEP 3).

In another embodiment, there is provided a method for identifying a subject at high risk for the development, recurrence, or metastasis of cancer comprising: (a) obtaining a lung test sample from a subject; (b) providing a specific10q22 DNA probe; (c) contacting said probe with said test sample; and (d) analyzing DNA from said test sample, whereby aberrations in the hybridization of said probe to said DNA is compared to wild type DNA, indicating the risk for the development, recurrence or metastasis of said cancers. More specifically the method identifies the risk of the recurrence or metastasis of cancers. In a further embodiment, the probe size is from 1000 to 2000 base pairs or larger, for detection in a region of about 200,000 base pairs. In an additional embodiment, a specific 3p21 probe can be used with the 10q22 DNA probe. The control probe is a chromosome 10 stable marker, or more specifically Centromere10 (CEP10).

In another embodiment, there is provided a method for predicting the progression or metastasis of non-small cell carcinoma and other carcinoma in a subject comprising: (a) obtaining a test sample from a subject; (b) providing a RPL14, CD39L3, PMGM, or GC20 gene probe; (c) contacting said probe with said test sample; and (d) analyzing DNA from said test sample.

In yet another embodiment, there is provided a method for predicting the progression or metastasis of non-small cell carcinoma in a subject comprising: (a) obtaining a lung test sample from a subject; (b) providing a specific10q22 DNA probe; (c) contacting said probe with said test sample; and (d) analyzing DNA from said test sample.

In a further embodiment, there is provided a method for the staging lung of cancer in a subject comprising determining the deletion distribution of the 3p21.3 region.

In one embodiment, there is provided a method of determining likelihood of relapse or a new primary for a cancer subject comprising determining genetic aberrations at chromosomal loci 3p21.3 or 10q22 in DNA of bronchial tissue adjacent to tumor tissue from said subject, wherein abnormalities in DNA of said adjacent tissue correlate with relapse of said cancer. The cancer can comprise lung cancer or more specifically non-small cell carcinoma, adenocarcinoma, or squamous cell carcinoma. A specific gene probe may comprise RPL 14, CD39L3, PMGM, or GC20, or a 10q22 DNA probe. The 10q22 probe lies adjacent to the PTEN gene which is frequently involved non-small cell cancer. Both the 3p and the 10q probe can be used simultaneously. The test sample can be chosen from the same or contralateral lung, and can consist of tumorous or nontumorous bronchial cells.

In yet another embodiment, there is provided a method of identifying an individual to be segregated from a high risk environment comprising: (a) obtaining a test sample from a subject; (b) providing a gene probe containing RPL14, CD39L3, PMGM, and GC20 genes and PTEN or a 10q22 DNA probe, (c) contacting said probe with said test sample; and (d) analyzing DNA from said test sample, whereby said analysis is used to identify an individual who is highly susceptible to the development of lung cancer and who should not be exposed to a high risk environment.

BRIEF DESCRIPTION OF THE DRAWINGS

The patent application contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.

The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present invention. The invention may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein:

FIG. 1—3p Relapse Status. 3p21 and 10q22 deletion rates in adjacent bronchial epithelial cells of patients with benign lung disease, patients who developed stage 1 non-small cell cancer that did not relapse, and patients with stage 1 non-small cell cancer with relapse compared to 95% Cl N3P. Open squares indicate that the subjects are smokers, closed circles indicate that the subjects do not smoke (p<0.001).

FIG. 2—10q Relapse Status. 10q22 deletion rates in adjacent bronchial epithelial cells of patients with benign lung disease, patients who developed stage 1 non-small cell cancer that did not relapse, and patients with stage 1 non-small cell cancer with relapse compared to 95% C1 N10Q. Open squares indicate that the subjects are smokers, closed circles indicate that the subjects do not smoke (p<0.001).

FIG. 3—Lung Cancer Tissues. Diagram of tissue demonstrating histogenesis of lung cancer.

FIG. 4.—Percentage of Tumor cells in Dilutions. Chart showing the percentage of cells with 3p21.33 deletion detected by FISH relative to the concentration of a dilution sample.

FIG. 5—Normal Metaphase Cells. Microscope images where normal cells typically display 2 CEP3 (orange) signals and 2 3p21.33 (green) signals and tumor cells display 3 CEP3 (orange) signals and 2 3p21.33 (green) signals.

FIG. 6—Normal Interphase Cells (Lymphocytes). Microscope images where normal cells typically display 2 CEP3 (orange) signals and 2 3p21.33 (green) signals and tumor cells display 3 CEP3 (orange) signals and 2 3p21.33 (green) signals.

FIG. 7—Normal Bronchial Wash Cell. Microscope images where normal cells typically display 2 CEP3 (orange) signals and 2 3p21.33 (green) signals and tumor cells display 3 CEP3 (orange) signals and 2 3p21.33 (green) signals.

FIG. 8—Lung Cancer Cells. Microscope images where normal cells typically display 2 CEP3 (orange) signals and 2 3p21.33 (green) signals and tumor cells display 3 CEP3 (orange) signals and 2 3p21.33 (green) signals.

FIG. 9—Lung Cancer Cells. Microscope images where normal cells typically display 2 CEP3 (orange) signals and 2 3p21.33 (green) signals and tumor cells display 3 CEP3 (orange) signals and 2 3p21.33 (green) signals.

FIG. 10—10Q as a Predictor of Relapse. In a multivariate analysis, looking at the outcome in 96 patients, the deletion of 10Q in adjacent bronchial epithelial cells is a significant predictor of relapse.

FIG. 11—10Q as a Predictor of Long Term Survival. In a multivariate analysis, looking at the outcome in 96 patients, the deletion of 10Q in adjacent bronchial epithelial cells is a significant predictor of relapse long term survival.

FIG. 12—Interval for Patients who are Relapse Free. The proportion of patients who are relapse free from 0 to 108 months for patients who have a N10q value >5 and N10q<5.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS I. The Present Invention

As stated above, deletions in the 3p21.3 and 10q22 regions of human chromosomes 3 and the 10 have been shown to be associated with cancers. The present invention has shown these regions also to be predictive of the development of neoplasia and progression of neoplastic events. In particular embodiments, the inventors have developed novel DNA FISH probes and tested them on patients at M. D. Anderson Cancer Center (MDACC) in early stage non-small cell lung neoplasms using archival tissue from stage I non-small cell cancers.

The probes are used in the early detection of cancer and in chemopreventive studies as an intermediate biomarker. Until now there have been no reports about the application of these DNA probes to paraffin embedded clinical tumor specimens using fluorescence in situ hybridization (FISH) and microdissection. The FISH technique allows the measurement of an average level of deletion of a gene in a tumor, as well as the actual number and distribution of the gene in individual, morphological cells. The inventors propose that deletion distribution of the RPL14, CD39L3, PMGM or GC20 genes and 10q22 locus are useful as a diagnostic tool in determining the stage of lung cancer patients.

A. Smoking Related Cancers

The current invention is useful for the prognosis and diagnosis of lung cancers, which can be defined by a number of histologic classifications including: squamous cell carcinomas such as squamous carcinoma; small cell carcinomas such as oat cell carcinoma, intermediate cell type carcinoma, combined oat and cell carcinoma; adenocarcinomas such as acinar adenocarcinoma, papillary adenocarcinoma, bronchioloalveolar carcinoma, and solid carcinoma with mucus formation; large cell carcinoma such as giant cell carcinoma and clear cell carcinoma; adenosquamous carcinoma; carcinoid; and bronchial gland carcinomas such as adenoid cystic, and mucoepidermoid carcinoma. Diagnosis and prognosis of other smoking related cancers is possible with these probes. Squamous cell carcinoma of the head and neck has the same risk factors as lung cancer is hypothesized to have similar etiology (Shriver, 1998). Similarly, smoking is an etiological factor for cancer of the bladder, head, neck, kidneys, pancreas, and cancer of the upper airways including cancer of the mouth, throat, pharynx, larynx, or esophagus.

B. Tumorgenesis

The deletions of various genes in tumor tissue has been well studied in the art. However, there remains a need for probes that are significant for detecting early molecular events in the development of cancers, as well as molecular events that make patients susceptible to the development of cancer. Probes used for the staging of cancer are also of interest. The proposed sequence leading to tumorigenesis includes genetic instability at the cellular or submicroscopic level as demonstrated by loss or gain of chromosomes, leading to a hyperproliferative state due to theoretical acquisition of factors that confer a selective proliferative advantage. Further, at the genetic level, loss of function of cell cycle inhibitors and tumor suppressor genes (TSG), or amplification of oncogenes that drive cell proliferation, are implicated.

Following hyperplasia, a sequence of progressive degrees of dysplasia, carcinoma-in-situ and ultimately tumor invasion is recognized on histology. These histologic changes are both preceded and paralleled by a progressive accumulation of genetic damage. At the chromosomal level genetic instability is manifested by a loss or gain of chromosomes, as well as structural chromosomal changes such as translocation and inversions of chromosomes with evolution of marker chromosomes. In addition cells may undergo polyploidization. Single or multiple clones of neoplastic cells may evolve characterized in many cases by aneuploid cell populations. These can be quantitated by measuring the DNA content or ploidy relative to normal cells of the patient by techniques such as flow cytometry or image analysis.

C. Prognostic Factors and Staging

At present, the most important prognostic factor regarding the survival of patients with lung cancer of non-small cell type is the stage of disease at diagnosis. Small cell cancer usually presents with wide spread dissemination hence the staging system is less applicable. The staging system was devised based on the anatomic extent of cancer and is now know as the TNM system based on anatomical size and spread within the lung and adjacent structures, regional lymph nodes and distant metastases. The only hope presently for a curative procedure lies n the operability of the tumor which can only be resected when the disease is at a low sage, that is confined to the lung.

Occult Carcinoma TX NO MO Occult carcinoma with bronchopulmonary secretions containing malignant cells but without other evidence of the primary tumor or evidence of metastasis Stage 1 TIS NO MO Carcinoma in situ T1 NO MO Tumor that can be classified T1 without any metastasis to the regional lymph nodes T1 N1 MO Tumor that can be classified T1 with metastasis to the lymph nodes in the ipsilateral hilar region only T2 N1 MO Tumor that can be classified T2 without any metastasis to nodes or distant metastasis Stage II T2 N1 MO Tumor classified as T2 with metastasis to the lymph nodes in the ipsilateral hilar region only Stage III T3 with an N or M Any tumor more extensive than T2 N2 with an T or M Any tumor with metastasis to the lymph nodes in the mediastinum M1 with any T or N Any tumor with distant metastasis

D. Grading of Tumors

The histological type and grade of lung cancer do have some prognostic impact within the stage of disease with the best prognosis being reported for stage I adenocarcinoma, with 5 year survival at 50% and 1-year survival at 65% and 59% for the bronchiolar-alveolar and papillary subtypes (Naruke et al., 1988; Travis et al., 1995; Carriaga et al., 1995). For squamous cell carcinoma and large cell carcinoma the 5 year survival is around 35%. Small cell cancer has the worst prognosis with a 5 year survival rate of only 12% for patients with localized disease (Carcy et al., 1980; Hirsh, 1983; Vallmer et al., 1985). For patients with distant metastases survival at 5 years is only 1-2% regardless of histological subtype (Naruke et al., 1988). In addition to histological subtype, it has been shown that histological grading of carcinomas within subtype is of prognostic value with well differentiated tumors having a longer overall survival than poorly differentiated neoplasms. Well differentiated localized adencarcinoma has a 69% overall survival compared to a survival rate of only 34% of patients with poorly differentiated adenocarcinoma (Hirsh, 1983). The 5 year survival rates of patients with localized squamous carcinoma have varied from 37% for well differentiated neoplasms to 25% for poorly differentiated squamous carcinomas (Ihde, 1991).

The histologic criteria for subtyping lung tumors is as follows: squamous cell carcinoma consists of a tumor with keratin formation, keratin pearl formation, and/or intercellular bridges. Adenocarcinomas consist of a tumor with definitive gland formation or mucin production in a solid tumor. Small cell carcinoma consists of a tumor composed of small cells with oval or fusiform nuclei, stippled chromatin, and indistinct nuclei. Large cell undifferentiated carcinoma consists of a tumor composed of large cells with vesicular nuclei and prominent nucleoli with no evidence of squamous or glandular differentiation. Poorly differentiated carcinoma includes tumors containing areas of both squamous and glandular differentiation.

E. Development of Carcinomas

The evolution of carcinoma of the lung is most likely representative of a field cancerization effect as a result of the entire aero-digestive system being subjected to a prolonged period of carcinogenic insults such as benzylpyrenes, asbestosis, air pollution and chemicals other carcinogenic substances in cigarette smoke or other environmental carcinogens. This concept was first proposed by Slaughter et al. (1953). Evidence for existence of a field effect is the common occurrence of multiple synchronous for metachronous second primary tumors (SPTs) that may develop throughout the aero-digestive tract in the oropharynx, upper esophagus or ipsilateral or contralateral lung.

Accompanying these molecular defects is the frequent manifestation of histologically abnormal epithelial changes including hyperplasia, metaplasia, dysplasia, and carcinoma-in-situ. It has been demonstrated in smokers that both the adjacent normal bronchial epithelium as well as the preneoplastic histological lesions may contain clones of genetically altered cells. (Wistuba et al., 2000).

Liciardello et al. (1989) found a 10-40% incidence of metachronous tumors and a 9-14% incidence of synchronous SPTs in the upper and lower aero-digestive tract, mostly in patients with the earliest primary tumors SPTs may impose a higher risk than relapse from the original primary tumor and may prove to be the major threat to long term survival following successful therapy for early stage primary head, neck or lung tumors. Hence it is vitally important to follow these patients carefully for evidence of new SPTs in at risk sites for new malignancies specifically in the aero-digestive system.

In addition to chromosomal changes at the microscopic level, multiple blind bronchial biopsies may demonstrate various degrees of intraepithelial neoplasia at loci adjacent to the areas of lung cancer. Other investigators have shown that there are epithelial changes ranging from loss of cilia and basal cell hyperplasia to CIS in most light and heavy smokers and all lungs that have been surgically resected for cancer. (Auerbach et al., 1961). Voravud et al. (1993) demonstrated by in-situ hybridization (ISH) studies using chromosome-specific probes for chromosomes 7 and 17 that 30-40% of histologically normal epithelium adjacent to tumor showed polysomies for these chromosomes. In addition there was a progressive increase in frequency of polysomies in the tissue closest to the carcinoma as compared to normal control oral epithelium from patients without evidence of carcinoma. The findings of genotypic abnormalities that increased closer to the area of the tumor support the concept of field cancerization. Interestingly there was no increase in DNA content as measured in the normal appearing mucosa in a Feulgen stained section adjacent to the one where the chromosomes were measured, reflecting perhaps that insufficient DNA had been gained in order to alter the DNA index. Interestingly a very similar increase in DNA content was noted both in dysplastic areas close to the cancer and in the cancerous areas suggesting that complex karyotypic abnormalities that are clonal have already been established in dysplastic epithelium adjacent to lung cancer. Others have also shown an increase in number of cells showing p53 mutations in dysplastic lesions closest to areas of cancer, which are invariably also p53 mutated. Other chromosomal abnormalities that have recently been demonstrated in tumors and dysplastic epithelium of smokers includes deletions of 3p, 17p, 9 p and 5q (Feder et al., 1998; Yanagisawa et al., 1996; Thiberville et al., 1995).

F. Chromosome Deletions in Lung Cancer

Small cell lung cancer (SCLC) and non-small cell lung cancer commonly display cytogenetically visible deletions on the short arm of chromosome 3 (Hirano et al., 1994; Valdivieso et al., 1994; Cheon et al., 1993; Pence et al., 1993). This 3p deletion occurs more frequently in the lung tumor tissues of patients who smoke than it does in those of nonsmoking patient. (Rice et al., 1993) Since approximately 85% lung cancer patients were heavy cigarette smokers (Mrkve et al., 1993), 3p might contain specific DNA loci related to the exposure of tobacco carcinogens. It also has been reported that 3p deletion occurs in the early stages of lung carcinogenesis, such as bronchial dysplasia (Pantel et al., 1993). In addition to cytogenetic visible deletions, loss of heterozygosity (LOH) studies have defined 3-21.3 as one of the distinct regions that undergo loss either singly or in combination (Fontanini et al., 1992; Liewald et al., 1992). Several other groups have found large homozygous deletions at 3p21.3 in lung cancer (Macchiarini et al., 1992; Miyamoto et al., 1991; Ichinose et al., 1991; Yamaoka et al., 1990). Transfer of DNA fragments from 3-21.3-3p21.2 into lung tumor cell lines could suppress the tumorigenesis. (Sahin et al., 1990; Volm et al., 1989). These finding strongly suggest the presence of at least one tumor suppressor gene in this specific chromosome region whose loss will initiate lung carcinogenesis.

Cytogenetic observation of lung cancer has shown an unusual consistency in the deletion rate of chromosome 3p. In fact, small cell lung cancer (SCLC) demonstrates a 100% deletion rate within certain regions of chromosome 3p. Non small cell lung cancer (NSCLC) demonstrates a 70% deletion rate (Mitsudomi et al., 1996; Shiseki et al., 1996). Loss of heterozygosity and comparative genomic hybridization analysis have shown deletions between 3p14.2 and 3p21.3 to be the most common finding for lung carcinoma and is postulated to be the most crucial change in lung tumorigenesis (Wu et al., 1998). It has been hypothesized that band 3p21.3 is the location for lung cancer tumor suppressor genes. The hypothesis is supported by chromosome 3 transfer studies, which reduced tumorigenicity in lung adenocarcinoma.

Allelotype studies on non-small cell lung carcinoma indicated loss of genetic material on chromosome 10q in 27% of cases. Studies of chromosome 10 allelic loss have shown that there is a very high incidence of LOH in small cell lung cancer, up to 91%. (Alberola et 41995; Ayabe et al., 1994). A statistically significant LOH of alleles on 10q was noted in metastatic squamous cell carcinoma (SCC) in 56% of cases compared to non-metastatic SCC with LOH seen in only 14% of cases. (Ayabe et al., 1994). No LOH was seen in other subtypes on NSCLC. Peterson (1995) used paired samples of tumor and normal tissue to assess LOB. By micro-satellite polymorphism analysis, a high incidence of loss was found between D10s677 and D10S1223. This region spans the long arm of chromosome 10 at bands q21-q24 and overlaps the region deleted in the a study of advanced stage high grade bladder cancers which demonstrated a high frequency of allele loss within a 2.5cM region at 10q22.3-10q23.1 (Kim et al., 1996). II. The 3p21.3 Gene Probes

A. Structural Features

Recently, the human ribosomal L14 (RPL14) gene (GenBank Accession NM 003973, SEQ ID NO: 1), and the genes CD39L3 (GenBank Accession AAC39884 and AF039917; SEQ ID NO: 3), PMGM (GenBank Accession P15259 and J05073; SEQ ID NO: 5), and GC20 (GenBank Accession NM_(—)005875; SEQ ID NO: 7) were isolated from a BAC (GenBank Accession AC019204, herein incorporated by reference) and located in the 3p21.3 band within the smallest region of deletion overlap of various lung tumors. The RPL14 gene sequence contains a highly polymorphic trinucleotide (CTG) repeat array, which encodes a variable length polyalanine tract. Polyalanine tracts are found in gene products of developmental significance that bind DNA or regulate transcription. For example, Drosophila proteins Engraled, Kruppel and Even-Skipped all contain polyalanine tracts that act as transcriptional repressors. Genotype analysis of RPL14 shows that this locus is 68% heterozygous in the normal population, compared with 25% in NSCLC cell lines. Cell cultures derived from normal bronchial epithelium show a 65% level of heterozygosity, reflecting that of the normal population.

B. Functional Aspects

Genes with a regulatory function such as the RPL14 gene (SEQ ID NO: 1), along with the genes CD39L3, PMGM, and GC20 (SEQ ID NOS: 3, 5 and 7) and analogs thereof, are good candidates for diagnosis of tumorigenic events. It has been postulated that functional changes of the RPL14 protein (SEQ ID NO: 2) can occur via a DNA deletion mechanism of the trinucleotide repeat encoding for the protein. This deletion mechanism makes the RPL14 gene and attractive sequence that may be used as a marker for the study of lung cancer risk (Shriver et al., 1998). In addition, the RPL14 gene shows significant differences in allele frequency distribution in ethnically defined populations, making this sequence a useful marker for the study of ethnicity adjusting lung cancer (Shriver et al., 1998). Therefore, this gene is useful in the early detection of lung cancer, and in chemopreventive studies as an intermediate biomarker.

III. The 10q22 Gene Probes

A. Structural Features

The 10q22 BAC (46b12) is 200 Kb and is adjacent and centromeric to PTEN/MMAC1 (GenBank Accession AF067844), which is at 10q22-23 and can be purchased through Research Genetics (Huntsville, Ala.). Alterations to 10q22-25 has been associated with multiple tumors, including lung, prostate, renal, and endomentrial carcinomas, melanoma, and meningiomas, suggesting the possible suppressive locus affecting several cancers in this region. The PTEN/MMAC1 gene, encoding a dual-specificity phosphatase, is located in this region, and has been isolated as a tumor suppressor gene that is altered in several types of human tumors including brain, bladder, breast and prostate cancers. PTEN/MMAC1 mutations have been found in some cancer cell lines, xenografts, and hormone refractory cancer tissue specimens. Because the inventor's 10q22 BAC DNA sequence is adjacent to this region, the DNA sequences in the BAC 10q22 may be involved in the genesis and/or progression of human lung cancer.

B. Functional Aspects

Functional evidence for the presence of tumor suppressor genes on 10q has been provided by microcell-mediated chromosomal transfer. The resulting hybrid clones displayed a suppressed tumorigenic phenotype with the inability to proliferate in nude mice and soft agarose. Sequence analysis of the PTEN/MMAC1 gene in lung cancer revealed a G to C substitution located 8 by upstream of the coding region of exon1 and which seems to be a polymorphism, in 4 of the 30 cases of lung cancer tested. Somatic mutations of the TPEN/MMAC1 gene were not identified in any of the tumors at the primary and metastatic sites of lung cancer, indicating that point mutations in the PTEN/MMAC1 gene are probably not an important factor in tumorigenesis and the progression of a major subset of lung cancers. Other more important tumor suppressor genes must lie close to the PTEN/MMAC1 gene, in the vicinity of the inventors' 10q22 BAC locus. Therefor, the 10q22 probe is useful in the further development of clinical biomarkers for the early detection of neoplastic events, for risk assessment and monitoring the efficacy of chemoprevention therapy in high risk former or current smokers.

IV. Nucleic Acids

The inventors' have identified the probes for the human chromosome region 3p21.3 and human chromosome region 10q22. In addition, it should be clear that the present invention is not limited to the specific nucleic acids disclosed herein.

A. Probes and Primers

Naturally, the present invention encompasses DNA segments that are complementary, or essentially complementary, to target sequences. Nucleic acid sequences that are “complementary” are those that are capable of base-pairing according to the standard Watson-Crick complementary rules. As used herein, the term “complementary sequences” means nucleic acid sequences that are substantially complementary, as may be assessed by the same nucleotide comparison set forth above, or as defined as being capable of hybridizing to a target nucleic acid segment under relatively stringent conditions such as those described herein. These probes may span hundreds or thousands of base pairs.

Alternatively, the hybridizing segments may be shorter oligonucleotides. Sequences of 17 bases long should occur only once in the human genome and, therefore, suffice to specify a unique target sequence. Although shorter oligomers are easier to make and increase in vivo accessibility, numerous other factors are involved in determining the specificity of hybridization. Both binding affinity and sequence specificity of an oligonucleotide to its complementary target increases with increasing length. It is contemplated that exemplary oligonucleotides of about 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 250, 500, 700, 722, 900, 992, 1000, 1500, 2000, 2500, 2800, 3000, 3500, 3800, 4000, 5000 or more base pairs will be used, although others are contemplated. As mentioned above, longer polynucleotides encoding 7000, 10000. 12000 bases and longer are contemplated as well. Such oligonucleotides will find use, for example, as probes in FISH, Southern and Northern blots and as primers in amplification reactions.

It will be understood that this invention is not limited to the particular probes disclosed herein and particularly is intended to encompass at least nucleic acid sequences that are hybridizable to the disclosed sequences or are functional sequence analogs of these sequences. For example, a partial sequence may be used to identify a structurally-related gene or the full length genomic or cDNA clone from which it is derived. Those of skill in the art are well aware of the methods for generating cDNA and genomic libraries which can be used as a target for the above-described probes (Sambrook et al., 1989).

For applications in which the nucleic acid segments of the present invention are incorporated into vectors, such as plasmids, cosmids or viruses, these segments may be combined with other DNA sequences, such as promoters, polyadenylation signals, restriction enzyme sites, multiple cloning sites, other coding segments, and the like, such that their overall length may vary considerably. It is contemplated that a nucleic acid fragment of almost any length may be employed, with the total length preferably being limited by the ease of preparation and use in the intended recombinant DNA protocol.

DNA segments encoding a specific gene may be introduced into recombinant host cells and employed for expressing a specific structural or regulatory protein. Alternatively, through the application of genetic engineering techniques, subportions or derivatives of selected genes may be employed. Upstream regions containing regulatory regions such as promoter regions may be isolated and subsequently employed for expression of the selected gene.

B. Labeling of Probes

In certain embodiments, it will be advantageous to employ nucleic acid sequences of the present invention in combination with an appropriate means, such as a label, for determining hybridization. A wide variety of appropriate indicator means are known in the art, including fluorescent, radioactive, chemiluminescent, electroluminescent, enzymatic tag or other ligands, such as avidin/biotin, antibodies, affinity labels, etc., which are capable of being detected. In preferred embodiments, one may desire to employ a fluorescent label such as digoxigenin, spectrum orange, fluorosein, eosin, an acridine dye, a rhodamine, Alexa 350, Alexa 430, AMCA, BODIPY 630/650, BODIPY 650/665, BODIPY-FL, BODIPY-R6G, BODIPY-TMR, BODIPY-TRX, cascade blue, Cy2, Cy3, Cy5,6-FAM, HEX, 6-JOE, Oregon green 488, Oregon green 500, Oregon green 514, pacific blue, REG, ROX, TAMRA, TET, or Texas red.

In the case of enzyme tags such as urease alkaline phosphatase or peroxidase, colorimetric indicator substrates are known which can be employed to provide a detection means visible to the human eye or spectrophotometrically, to identify specific hybridization with complementary nucleic acid-containing samples. Examples of affinity labels include but are not limited to the following: an antibody, an antibody fragment, a receptor protein, a hormone, biotin, DNP, or any polypeptide/protein molecule that binds to an affinity label and may be used for separation of the amplified gene.

The indicator means may be attached directly to the probe, or it may be attached through antigen bonding. In preferred embodiments, digoxigenin is attached to the probe before denaturization and a fluorophore labeled anti-digoxigenin FAB fragment is added after hybridization.

C. Hybridization Conditions

Suitable hybridization conditions will be well known to those of skill in the art. Conditions may be rendered less stringent by increasing salt concentration and decreasing temperature. For example, a medium stringency condition could be provided by about 0.1 to 0.25 M NaCl at temperatures of about 37° C. to about 55° C., while a low stringency condition could be provided by about 0.15 M to about 0.9 M salt, at temperatures ranging from about 20° C. to about 55° C. Thus, hybridization conditions can be readily manipulated, and thus will generally be a method of choice depending on the desired results.

In other embodiments, hybridization may be achieved under conditions of, for example, 50 mM Tris-HCl (pH 8.3), 75 mM KCl, 3 mM MgCl₂, 10 mM dithiothreitol, at temperatures between approximately 20° C. to about 37° C. Other hybridization conditions utilized could include approximately 10 mM Tris-HCl (pH 8.3), 50 mM KCl, 1.5 μM MgCl₂, at temperatures ranging from approximately 40° C. to about 72° C. Formamide and SDS also may be used to alter the hybridization conditions.

V. Biomarkers

Various biomarkers of prognostic significance can be used in conjunction with the 3p21.3 or the 10q22 nucleic acid probes. These biomarkers could aid in predicting the survival in low stage cancers and the progression from preneoplastic lesions to invasive lung cancer. These markers can include proliferation activity as measured by Ki-67 (MIB1), angiogenesis as quantitated by expression of VEGF and microvessels using CD34, oncogene expression as measured by erb B2, and loss of tumor suppresser genes as measured by p53 expression.

Multiple biomarker candidates have been implicated in the evolution of neoplastic lung lesions. Bio-markers that have been studies include general genomic markers including chromosomal alterations, specific genomic markers such as alterations in proto-oncogenes such as K-Ras, Erbβ1/EGFR, Cyclin D; proliferation markers such as Ki67 or PCNA, squamous differentiation markers, and nuclear retinoid receptors (Papadimitrakopoulou et al., 1996) The latter are particularly interesting as they may be modulated by specific chemopreventive drugs such as 13-cis-retinoic acid or 4HPR and culminate in apoptosis of the defective cells with restoration of a normally differentiated mucosa (Zou et al., 1998).

A. Tumor Angiogenesis by Microvessel Counts

Tumor angiogenesis can be quantitated by microvessel density and is a viable prognostic factor in stage 1 NSCLC. Tumor microvessel density appears to be a good predictor of survival in stage 1 NSCLC.

B. Vascular Endothelial Growth Factor (VEGF)

VEGF (3, 6-8 ch 4) an endothelial cell specific mitogen is an important regulator of tumor angiogenesis who's expression correlates well with lymph node metastases and is a good indirect indicator of tumor agniogenesis. VEGF in turn is upregulated by P53 protein accumulation in NSCLC.

C. p53

The role of p53 mutations in predicting progression and survival of patients with NSCLC is widely debated. Although few studies imply a negligible role, the majority of the studies provide compelling evidence regarding the role of p53 as one of the prognostic factors in NSCLC. The important role of p53 in the biology of NSCLC has been the basis for adenovirus mediated p53 gene transfer in patients with advanced NSCLC (Carcy et al., 1980). In addition p53 has also been shown to be an independent predictor of chemotherapy response in NSCLC. In a recent study (Valimer et al., 1985), the importance of p53 accumulation in preinvasive bronchial lesions from patients with lung cancer and those who did not progress to cancer were studied. It was demonstrated that p53 accumulation in preneoplastic lesions had a higher rate of progression to invasion than did p53 negative lesions.

D. c-erb-B2

Similar to p53, c-erg-B2 (Her2/neu) expression has also been shown to be a good marker of metastatic propensity and an indicator of survival in these tumors.

E. Ki-67 Proliferation Marker

In addition to the above markers, tumor proliferation index as measured by the extent of labeling of tumor cells for Ki-67, a nuclear antigen expressed throughout cell cycle correlates significantly with clinical outcome in Stage 1 NSCLC (Feinstein et al., 1970). The higher the tumor proliferation index the poorer is the disease free survival labeling indices provides significant complementary, if not independent prognostic information in Stage 1 NSCLC, and helps in the identification of a subset of patients with Stage 1 NSCLC who may need more aggressive therapy.

VI. Prognosis and Diagnosis of Cancers Using 3p21.3 and 10q22 Gene Probes

Alterations in the 3p21.3 and 10q22 loci are known to be associated with a number of cancers. More specifically, point mutations, deletions, insertions or regulatory perturbations relating to the 3p21.3 and 10q22 loci may cause cancer or promote cancer development, cause or promoter tumor progression at a primary site, and/or cause or promote metastasis. Other phenomena at the 3p21.3 and 10q22 loci include angiogenesis and tissue invasion. Thus, the present inventors have demonstrated that deletions at 3p21.3 and 10q22 can be used not only as a diagnostic or prognostic indicator of cancer, but to predict specific events in cancer development, progression and therapy.

A variety of different assays are contemplated in this regard, including but not limited to, fluorescent in situ hybridization (FISH), direct DNA sequencing, PFGE analysis, Southern or Northern blotting, single-stranded conformation analysis (SSCA), RNase protection assay, allele-specific oligonucleotide (ASO), dot blot analysis, denaturing gradient gel electrophoresis, RFLP and PCR-SSCP.

Various types of defects are to be identified. Thus, “alterations” should be read as including deletions, insertions, point mutations and duplications. Point mutations result in stop codons, frameshift mutations or amino acid substitutions. Somatic mutations are those occurring in non-germline tissues. Germ-line tissue can occur in any tissue and are inherited.

A. Samples

One embodiment of the instant invention comprises a method for detecting variation in the hybridization of the probes to DNA. This may comprise determining specific alterations in the expressed product, or may simply involve detecting gross structural abnormalities. Such cancer may involve cancers of the lung, upper airway primary or secondary cancer, bladder, urithial, head and neck, esophagus, kidney, pancreas, mouth, throat, pharynx, larynx, brain, liver, spleen, small intestine, blood cells, lymph node, colon, breast, endometrium, stomach, prostate, testicle, ovary, skin, bone marrow, blood or other tissue.

In particular, the present invention relates to the diagnosis and prognosis of smoking related cancers. More particularly, the present invention relates to the diagnosis and prognosis of lung cancer which includes, but is not limited to: squamous cell carcinomas such as squamous carcinoma; small cell carcinomas such as oat cell carcinoma, intermediate cell type carcinoma, combined oat and cell carcinoma; adenocarcinomas such as acinar adenocarcinoma, papillary adenocarcinoma, bronchioloalveolar carcinoma, and solid carcinoma with mucus formation; large cell carcinoma such as giant cell carcinoma and clear cell carcinoma; adenosquamous carcinoma; carcinoid; and bronchial gland carcinomas such as adenoid cystic, and mucoepidermoid carcinoma.

The biological sample can be any tissue or fluid that contains nucleic acids. Various embodiments include paraffin imbedded tissue, frozen tissue, surgical fine needle aspirations, cells of the skin, muscle, lung, head and neck, esophagus, kidney, pancreas, mouth, throat, pharynx, larynx, esophagus, facia, brain, prostate, breast, endometrium, small intestine, blood cells, liver, testes, ovaries, colon, skin, stomach, spleen, lymph node, bone marrow or kidney. Other embodiments include fluid samples such as bronchial brushes, bronchial washes, bronchial lavages, peripheral blood lymphocytes, lymph fluid, ascites, serous fluid, pleural effusion, sputum, cerebrospinal fluid, lacrimal fluid, esophageal washes, stool or urinary specimens such as bladder washing and urine.

Bronchial washes sample more area of bronchial epithelium but are also frequently cytologically normal. A more complete sampling of the respiratory passages may occur with a bronchiolar alveolar lavage in which both left and right proximal and distal small bronchi and bronchioles are washed out.

Nucleic acids are isolated from cells contained in the biological sample, according to standard methodologies (Sambrook et al., 1989). The nucleic acid may be genomic DNA or fractionated or whole cell RNA. Where RNA is used, it may be desired to convert the RNA to a complementary DNA.

Depending on the format, the specific nucleic acid of interest is identified in the sample directly using amplification or with a second, known nucleic acid following amplification. Next, the identified product is detected. The detection may involve indirect identification of the product via fluorescent label, chemiluminescence, radioactive scintigraphy of radiolabel or even via a system using electrical or thermal impulse signals (Affymax Technology; Bellus, 1994). Alternatively, the detection may be performed by visual means (e.g., ethidium bromide staining of a gel).

Following detection, one may compare the results seen in a given sample with a statistically significant reference group of samples from normal patients and patients that have or lack alterations in chromosome loci 3p21.3 or 10q22. In this way, it is possible to correlate the amount or kind of alterations detected with various clinical states.

B. Fluorescence In Situ Hybridization

Fluorescence in situ hybridization (FISH) can be used for molecular studies. FISH is used to detect highly specific DNA probes which have been hybridized to chromosomes using fluorescence microscopy. The DNA probe is labeled with fluorescent or non fluorescent molecules which are then detected by fluorescent antibodies. The probes bind to a specific region or regions on the target chromosome. The chromosomes are then stained using a contrasting color, and the cells are viewed using a fluorescence microscope.

Each FISH probe is specific to one region of a chromosome, and is labeled with fluorescent molecules throughout it's length. Each microscope slide contains many metaphases. Each metaphase consists of the complete set of chromosomes, one small segment of which each probe will seek out and bind itself to. The metaphase spread is useful to visualize specific chromosomes and the exact region to which the probe binds. The first step is to break apart (denature) the double strands of DNA in both the probe DNA and the chromosome DNA so they can bind to each other. This is done by heating the DNA in a solution of formamide at a high temperature (70-75° C.) Next, the probe is placed on the slide and the slide is placed in a 37° C. incubator overnight for the probe to hybridize with the target chromosome. Overnight, the probe DNA seeks out it's target sequence on the specific chromosome and binds to it. The strands then slowly reanneal. The slide is washed in a salt/detergent solution to remove any of the probe that did not bind to chromosomes and differently colored fluorescent dye is added to the slide to stain all of the chromosomes so that they may then be viewed using a fluorescent light microscope. Two, or more different probes labeled with different fluorescent tags can be mixed and used at the same time. The chromosomes are then stained with a third color for contrast. This gives a metaphase or interphase cell with three or more colors which can be used to detect different chromosomes at the same time, or to provide a control probe in case one of the other target sequences are deleted and a probe cannot bind to the chromosome. This technique allows, for example, the localization of genes and also the direct morphological detection of genetic defects.

The advantage of using FISH probes over microsatellite instability to test for loss of allelic heterozygosity is that the a) FISH is easily and rapidly performed on cells of interest and can be used on paraffin-embedded, or fresh or frozen tissue allowing the use of micro-dissection b) specific gene changes can be analyzed on a cell by cell basis in relationship to centomeric probes so that true homozygosity versus heterozygosity of a DNA sequence can be evaluated (use of PCR for microsatellite instability may permit amplification of surrounding normal DNA sequences from contamination by normal cells in a homozygously deleted region imparting a false positive impression that the allele of interest is not deleted) c) PCR cannot identify amplification of genes d) FISH using bacterial artificial chromosomes (BACs) permits easy detection and localization on specific chromosomes of genes of interest which have been isolated using specific primer pairs.

C. Template Dependent Amplification Methods

A number of template dependent processes are available to amplify the marker sequences present in a given template sample. One of the best known amplification methods is the polymerase chain reaction (referred to as PCR™) which is described in detail in U.S. Pat. Nos. 4,683,195, 4,683,202 and 4,800,159, and in Innis et al., 1990, each of which is incorporated herein by reference in its entirety.

Briefly, in PCR, two primer sequences are prepared that are complementary to regions on opposite complementary strands of the marker sequence. An excess of deoxynucleoside triphosphates are added to a reaction mixture along with a DNA polymerase, e.g., Taq polymerase. If the marker sequence is present in a sample, the primers will bind to the marker and the polymerase will cause the primers to be extended along the marker sequence by adding on nucleotides. By raising and lowering the temperature of the reaction mixture, the extended primers will dissociate from the marker to form reaction products, excess primers will bind to the marker and to the reaction products and the process is repeated.

A reverse transcriptase PCR amplification procedure may be performed in order to quantify the amount of mRNA amplified. Methods of reverse transcribing RNA into cDNA are well known and described in Sambrook et al., 1989. Alternative methods for reverse transcription utilize thermostable, RNA-dependent DNA polymerases. These methods are described in WO 90/07641 filed Dec. 21, 1990. Polymerase chain reaction methodologies are well known in the art.

Another method for amplification is the ligase chain reaction (“LCR”), disclosed in EPO No. 320 308, incorporated herein by reference in its entirety. In LCR, two complementary probe pairs are prepared, and in the presence of the target sequence, each pair will bind to opposite complementary strands of the target such that they abut. In the presence of a ligase, the two probe pairs will link to form a single unit. By temperature cycling, as in PCR, bound ligated units dissociate from the target and then serve as “target sequences” for ligation of excess probe pairs. U.S. Pat. No. 4,883,750 describes a method similar to LCR for binding probe pairs to a target sequence.

Qbeta Replicase, described in PCT Application No. PCT/US87/00880, may also be used as still another amplification method in the present invention. In this method, a replicative sequence of RNA that has a region complementary to that of a target is added to a sample in the presence of an RNA polymerase. The polymerase will copy the replicative sequence that can then be detected.

An isothermal amplification method, in which restriction endonucleases and ligases are used to achieve the amplification of target molecules that contain nucleotide 5′[alpha-thio]-triphosphates in one strand of a restriction site may also be useful in the amplification of nucleic acids in the present invention, Walker et al., (1992).

Strand Displacement Amplification (SDA) is another method of carrying out isothermal amplification of nucleic acids, which involves multiple rounds of strand displacement and synthesis, i.e., nick translation. A similar method, called Repair Chain Reaction (RCR), involves annealing several probes throughout a region targeted for amplification, followed by a repair reaction in which only two of the four bases are present. The other two bases can be added as biotinylated derivatives for easy detection. A similar approach is used in SDA. Target specific sequences can also be detected using a cyclic probe reaction (CPR). In CPR, a probe having 3′ and 5′ sequences of non-specific DNA and a middle sequence of specific RNA is hybridized to DNA that is present in a sample. Upon hybridization, the reaction is treated with RNase H, and the products of the probe identified as distinctive products that are released after digestion. The original template is annealed to another cycling probe and the reaction is repeated.

Still another amplification methods described in GB Application No. 2 202 328, and in PCT Application No. PCT/US89/01025, each of which is incorporated herein by reference in its entirety, may be used in accordance with the present invention. In the former application, “modified” primers are used in a PCR-like, template- and enzyme-dependent synthesis. The primers may be modified by labeling with a capture moiety (e.g., biotin) and/or a detector moiety (e.g., enzyme). In the latter application, an excess of labeled probes are added to a sample. In the presence of the target sequence, the probe binds and is cleaved catalytically. After cleavage, the target sequence is released intact to be bound by excess probe. Cleavage of the labeled probe signals the presence of the target sequence.

Other nucleic acid amplification procedures include transcription-based amplification systems (TAS), including nucleic acid sequence based amplification (NASBA) and 3SR (Kwoh et al., 1989; Gingeras et al., PCT Application WO 88/10315, incorporated herein by reference in their entirety). In NASBA, the nucleic acids can be prepared for amplification by standard phenol/chloroform extraction, heat denaturation of a clinical sample, treatment with lysis buffer and minispin columns for isolation of DNA and RNA or guanidinium chloride extraction of RNA. These amplification techniques involve annealing a primer which has target specific sequences. Following polymerization, DNA/RNA hybrids are digested with RNase H while double stranded DNA molecules are heat denatured again. In either case the single stranded DNA is made fully double stranded by addition of second target specific primer, followed by polymerization. The double-stranded DNA molecules are then multiply transcribed by an RNA polymerase such as T7 or SP6. In an isothermal cyclic reaction, the RNA's are reverse transcribed into single stranded DNA, which is then converted to double stranded DNA, and then transcribed once again with an RNA polymerase such as T7 or SP6. The resulting products, whether truncated or complete, indicate target specific sequences.

Davey et al., EPO No. 329 822 (incorporated herein by reference in its entirety) disclose a nucleic acid amplification process involving cyclically synthesizing single-stranded RNA (“ssRNA”), ssDNA, and double-stranded DNA (dsDNA), which may be used in accordance with the present invention. The ssRNA is a template for a first primer oligonucleotide, which is elongated by reverse transcriptase (RNA-dependent DNA polymerase). The RNA is then removed from the resulting DNA:RNA duplex by the action of ribonuclease H(RNase H, an RNase specific for RNA in duplex with either DNA or RNA). The resultant ssDNA is a template for a second primer, which also includes the sequences of an RNA polymerase promoter (exemplified by T7 RNA polymerase) 5′ to its homology to the template. This primer is then extended by DNA polymerase (exemplified by the large “Klenow” fragment of E. coli DNA polymerase I), resulting in a double-stranded DNA (“dsDNA”) molecule, having a sequence identical to that of the original RNA between the primers and having additionally, at one end, a promoter sequence. This promoter sequence can be used by the appropriate RNA polymerase to make many RNA copies of the DNA. These copies can then re-enter the cycle leading to very swift amplification. With proper choice of enzymes, this amplification can be done isothermally without addition of enzymes at each cycle. Because of the cyclical nature of this process, the starting sequence can be chosen to be in the form of either DNA or RNA.

Miller et al., PCT Application WO 89/06700 (incorporated herein by reference in its entirety) disclose a nucleic acid sequence amplification scheme based on the hybridization of a promoter/primer sequence to a target single-stranded DNA (“ssDNA”) followed by transcription of many RNA copies of the sequence. This scheme is not cyclic, i.e., new templates are not produced from the resultant RNA transcripts. Other amplification methods include “RACE” and “one-sided PCR” (Frohman, M. A., In: PCR PROTOCOLS: A GUIDE TO METHODS AND APPLICATIONS, Academic Press, N.Y., 1990; Ohara et al., 1989; each herein incorporated by reference in their entirety).

Methods based on ligation of two (or more) oligonucleotides in the presence of nucleic acid having the sequence of the resulting “di-oligonucleotide”, thereby amplifying the di-oligonucleotide, may also be used in the amplification step of the present invention. Wu et al., (1989), incorporated herein by reference in its entirety.

D. Southern/Northern Blotting

Blotting techniques are well known to those of skill in the art. Southern blotting involves the use of DNA as a target, whereas Northern blotting involves the use of RNA as a target. Each provide different types of information, although cDNA blotting is analogous, in many aspects, to blotting or RNA species.

Briefly, a probe is used to target a DNA or RNA species that has been immobilized on a suitable matrix, often a filter of nitrocellulose. The different species should be spatially separated to facilitate analysis. This often is accomplished by gel electrophoresis of nucleic acid species followed by “blotting” on to the filter.

Subsequently, the blotted target is incubated with a probe (usually labeled) under conditions that promote denaturation and rehybridization. Because the probe is designed to base pair with the target, the probe will binding a portion of the target sequence under renaturing conditions. Unbound probe is then removed, and detection is accomplished as described above.

E. Separation Methods

It normally is desirable, at one stage or another, to separate the amplification product from the template and the excess primer for the purpose of determining whether specific amplification has occurred. In one embodiment, amplification products are separated by agarose, agarose-acrylamide or polyacrylamide gel electrophoresis using standard methods. See Sambrook et al., 1989.

Alternatively, chromatographic techniques may be employed to effect separation. There are many kinds of chromatography which may be used in the present invention: adsorption, partition, ion-exchange and molecular sieve, and many specialized techniques for using them including column, paper, thin-layer and gas chromatography (Freifelder, 1982).

F. Detection Methods

Products may be visualized in order to confirm amplification of the marker sequences. One typical visualization method involves staining of a gel with ethidium bromide and visualization under UV light. Alternatively, if the amplification products are integrally labeled with radio- or fluorometrically-labeled nucleotides, the amplification products can then be exposed to x-ray film or visualized under the appropriate stimulating spectra, following separation.

In one embodiment, visualization is achieved indirectly. Following separation of amplification products, a labeled nucleic acid probe is brought into contact with the amplified marker sequence. The probe preferably is conjugated to a chromophore but may be radiolabeled. In another embodiment, the probe is conjugated to a binding partner, such as an antibody or biotin, and the other member of the binding pair carries a detectable moiety.

In one embodiment, detection is by a labeled probe. The techniques involved are well known to those of skill in the art and can be found in many standard books on molecular protocols. See Sambrook et al., 1989. For example, chromophore or radiolabel probes or primers identify the target during or following amplification.

One example of the foregoing is described in U.S. Pat. No. 5,279,721, incorporated by reference herein, which discloses an apparatus and method for the automated electrophoresis and transfer of nucleic acids. The apparatus permits electrophoresis and blotting without external manipulation of the gel and is ideally suited to carrying out methods according to the present invention.

In addition, the amplification products described above may be subjected to sequence analysis to identify specific kinds of variations using standard sequence analysis techniques. Within certain methods, exhaustive analysis of genes is carried out by sequence analysis using primer sets designed for optimal sequencing (Pignon et al, 1994). The present invention provides methods by which any or all of these types of analyses may be used. Using the sequences disclosed herein, oligonucleotide primers may be designed to permit the amplification of sequences throughout the RPL14, CD39L3, PMGM, or GC20 gene probes that may then be analyzed by direct sequencing.

G. Kit Components

All the essential materials and reagents required for detecting and sequencing RPL14, CD39L3, PMGM, or GC20 genes and variants thereof may be assembled together in a kit. This generally will comprise preselected primers and probes. Also included may be enzymes suitable for amplifying nucleic acids including various polymerases (RT, Taq, Sequenase™ etc.), deoxynucleotides and buffers to provide the necessary reaction mixture for amplification. Such kits also generally will comprise, in suitable means, distinct containers for each individual reagent and enzyme as well as for each primer or probe.

H. Chip Technologies

Specifically contemplated by the present inventors are chip-based DNA technologies such as those described by Hacia et al. (1996) and Shoemaker et al. (1996). These techniques involve quantitative methods for analyzing large numbers of genes rapidly and accurately. By tagging genes with oligonucleotides or using fixed probe arrays, one can employ chip technology to segregate target molecules as high density arrays and screen these molecules using methods such as fluorescence, conductance, mass spectrometry, radiolabeling, optical scanning, or electrophoresis. See also Pease et al. (1994); Fodor et al. (1991).

Biologically active DNA probes may be directly or indirectly immobilized onto a surface to ensure optimal contact and maximum detection. When immobilized onto a substrate, the gene probes are stabilized and therefore may be used repetitively. In general terms, hybridization is performed on an immobilized nucleic acid target or a probe molecule is attached to a solid surface such as nitrocellulose, nylon membrane or glass. Numerous other matrix materials may be used, including reinforced nitrocellulose membrane, activated quartz, activated glass, polyvinylidene difluoride (PVDF) membrane, polystyrene substrates, polyacrylamide-based substrate, other polymers such as poly(vinyl chloride), poly(methyl methacrylate), poly(dimethyl siloxane), photopolymers (which contain photoreactive species such as nitrenes, carbenes and ketyl radicals capable of forming covalent links with target molecules (Saiki, et al., 1994).

Immobilization of the gene probes may be achieved by a variety of methods involving either non-covalent or covalent interactions between the immobilized DNA comprising an anchorable moiety and an anchor. DNA is commonly bound to glass by first silanizing the glass surface, then activating with carbodimide or glutaraldehyde. Alternative procedures may use reagents such as 3-glycidoxypropyltrimethoxysilane (GOP) or aminopropyltrimethoxysilane (APTS) with DNA linked via amino linkers incorporated either at the 3′ or 5′ end of the molecule during DNA synthesis. Gene probe may be bound directly to membranes using ultraviolet radiation. With nitrocellous membranes, the probes are spotted onto the membranes. A UV light source is used to irradiate the spots and induce cross-linking. An alternative method for cross-linking involves baking the spotted membranes at 80° C. for two hours in vacuum.

Immobilization can consist of the non-covalent coating of a solid phase with streptavidin or avidin and the subsequent immobilization of a biotinylated polynucleotide (Holmstrom, 1993). Precoating a polystyrene or glass solid phase with poly-L-Lys or poly L-Lys, Phe, followed by the covalent attachment of either amino- or sulfhydryl-modified polynucleotides using bifunctional crosslinking reagents (Running, 1990 and Newton, 1993) can also be used to immobilize the probe onto a surface.

Immobilization may also take place by the direct covalent attachment of short, 5′-phosphorylated primers to chemically modified polystyrene plates (“Covalink” plates, Nunc) Rasmussen, (1991). The covalent bond between the modified oligonucleotide and the solid phase surface is introduced by condensation with a water-soluble carbodiimide. This method facilitates a predominantly 5′-attachment of the oligonucleotides via their 5′-phosphates.

Nikiforov et al. (U.S. Pat. No. 5,610,287) describes a method of non-covalently immobilizing nucleic acid molecules in the presence of a salt or cationic detergent on a hydrophilic polystyrene solid support containing an —OH, —C═O or —COOH hydrophilic group or on a glass solid support. The support is contacted with a solution having a pH of about 6 to about 8 containing the synthetic nucleic acid and the cationic detergent or salt. The support containing the immobilized nucleic acid may be washed with an aqueous solution containing a non-ionic detergent without removing the attached molecules.

There are two common variants of chip-based DNA technologies involving DNA microarrays with known sequence identity. For one, a probe cDNA (500˜5,000 bases long) is immobilized to a solid surface such as glass using robot spotting and exposed to a set of targets either separately or in a mixture. This method, “traditionally” called DNA microarray, is widely considered as developed at Stanford University. A recent article by Ekins and Chu (1999) provides some relevant details. The other variant includes an array of oligonucleotide (20˜25-mer oligos) or peptide nucleic acid (PNA) probes is synthesized either in situ (on-chip) or by conventional synthesis followed by on-chip immobilization. The array is exposed to labeled sample DNA, hybridized, and the identity/abundance of complementary sequences are determined. This method, “historically” called DNA chips, was developed at Affymetrix, Inc., which sells its products under the GeneChip® trademark.

VII. Examples

The following examples are included to demonstrate preferred embodiments of the invention. It should be appreciated by those of skill in the art that the techniques disclosed in the examples which follow represent techniques discovered by the inventor to function well in the practice of the invention, and thus can be considered to constitute preferred modes for its practice. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the concept, spirit and scope of the invention. More specifically, it will be apparent that certain agents which are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined by the appended claims.

Example 1 Lung Cancer Patients and the Correlation between RPL 14 Gene Deletion Percentage and Patient Survival

Tissue Samples Normal lung tissue and cancerous lung tissue were obtained from lung biopsies embedded in paraffin blocks. These paraffin embedded histologic tissue are from a clinically and pathologically well characterized group of patients with stage 1 lung cancers that underwent resection at M. D. Anderson Cancer Center (MDACC) and were obtained from MDACC cases on file. These retrospective samples were drawn so that the retrospective samples were as fresh as possible and still have at least 2 years follow-up. Many cases had 14 years of follow up. Demographic information for these patient groups include: age at the time of diagnosis, race, gender, dietary information, initial treatment, screening test with results, date of diagnosis and follow-up with status, other diagnosis information, tobacco history, alcohol history, other diagnoses associated with tobacco or alcohol use, and other drugs or treatments which might have a chemopreventative effect.

Cell Dissociation of Interphase Nuclei from Formalin Fixed Paraffin Embedded Blocks Punch biopsies of 30 histologically representative cancerous lung tissue and adjacent normal lung tissue were performed on paraffin embedded blocks and the resulting tissue sections were placed in 1.5 ml Eppendorf tubes. The same Punch biopsy procedure was performed on 10 controls originating from normal lung tissue with no trace of cancerous growth. A dewaxing/rehydration incubation protocol, with a 3 min centrifuge (12,000 r.p.m.) between each step was performed on the tissue blocks: Xylene (30 min), Xylene (10 min), 100% Ethanol (10 min), 95% Ethanol (10 min), 70% Ethanol (10 min), 50% Ethanol (10 min), H₂O (10 min), H₂O (10 min).

After the incubation, scissors were inserted into the 1.5 ml Eppendorf tubes and used to finely cut the tissue. This step is critical, as it mechanically removes cells from their connective tissue surroundings. Next, 1 ml Protease K solution was added to each Eppendorf and incubated at 37° C. for 2 hr, while vortexing every 20 min. After the 37° C. incubation, the tissue contents of the Eppendorfs were poured into nylon mesh covered 15 ml Eppendorf tubes. The 1.5 ml Eppendorf tubes were washed with PBS and poured again into their respective nylon mesh covered Eppendorfs, in order to minimize sample loss. The nylon mesh covered Eppendorfs were centrifuged at 750 r.p.m.×10 min and the supernatant was removed with a pipette. Depending on the size of the pellet, between 0.5-2 ml of PBS was added to dilute the cellular specimen.

Cytospin slides were prepared from the 15 ml Eppendorf tubes. The concentration of the 15 ml Eppendorfs were adjusted in accordance with microscopic analysis of the cytospin slides. If too many cells were present in the field of view, then the Eppendorf tubes were diluted subjectively with 0.1 ml aliquots of PBS. If too few were present in the field of view, then the Eppendorf tubes were re-centrifuged and an estimated amount of supernatant was removed. After appropriate concentration adjustments, all cytospin slides were placed in a FISH fixative solution (3 parts methanol: 1 part acetic acid) for 20 min. Slides were stored in a −20° C. freezer.

Growth of BAC and Isolation of RPL 14 Template. A colony was inoculated with a 10 ml culture containing 1.5 ml LB+12.5 μg/ml chloramphenicol. It was grown overnight at 37° C., while shaking at 200 r.p.m. The culture was transferred to a 1.5 ml mirofuge tube. The cells were pelleted at full speed in a microfuge for 30 sec and the supernatant was removed. The cell pellet was thoroughly resuspended in 100 μl chilled Solution I using a Pipetman. 200 μl of freshly prepared Solution II was added to the tubes and they were then placed on ice. Each tube was mixed 8-10 times via inversion and returned to the ice. The cells lysed and the solution grew clear and viscous. Next, 150 μl of Solution III was added. The tubes were mixed by inversion 8-10 times and returned to the ice. The addition of solution III caused the formation of a flocculent precipitate. The tubes were centrifuged for 6 min at room temperature at full speed in a microfuge. The supernatant was transferred to a new microfuge tube. Any visible debris that was transferred was removed with a toothpick or pipet tip. The DNA was precipitated by adding 1 ml room temperature 100% ethanol and centrifuged for 6 min at room temperature in a microfuge. The supernatant was carefully removed and the DNA pellet was washed briefly in 70% ethanol. The pellet was air dried briefly (approximately 10 min) before being dissolved in an appropriate volume of TE buffer.

Preparation of 3p21.3 DNA Probe. The genomic clone of the gene was isolated from the CITB human BAC (bacterial artificial chromosomes) DNA library pools (Research genetics, Huntsvill, Al0 using PCR technique with a specific 3p21.3 gene primer. Genomic DNA was isolated from this gene by growing the positive BAC clone and isolated gene RPL 14 genomic DNA using a Qiagen Plasmin Kit and following the manufactures directions. The gene DNA sequence was confirmed by using PCR with the same gene primer. Localization of the RPL14 gene on chromosome 3 was confirmed by using normal metaphase FISH. Digoxigen is added to the probed before denaturization of the slides, and follows the procedure in the Boehringer Mannheim Biochemicals kit. The 3p21.3 template DNA isolated from BAC clones was added to a cocktail of 36.5 ul distilled H₂O, 5 ul A4, 1 ul Digoxigenin-11-dUTP, 1 ul DNA polymerase-1, and 4.0 ul of 10× Enzyme mix. The final cocktail was incubated in a 15° C. water bath for 75 min. The enzymatic reaction was stopped via incubation of the cocktail in a 75° C. water bath for 15 min.

The efficiency of these probe depended on its size parameters. Using a 100 by DNA ladder marker and gel electrophoresis, the inventors could ensure that the 3p21.3 probe was between 200-1000 by size. The marker lane of the gel contained a 10 ul loaded sample: 1 ul 100 by DNA ladder, 2 ul loading buffer, 7 ul 1×TAE. The 3p21.3 lane of the gel contained a 10 ul loaded sample: 6 ul 3p21.3 DNA, 2 ul loading buffer, 7 ul 1×TAE. If the banding patterns of the gel showed the p4robe to be between 200-100 bp, then the 3p21.3 DAN probe would be ready for precipitation.

Precipitation of 3p21.3 DNA Probe. The probes are precipitated and bound with a fluorophore using a Nick Translation system (Life Technologies) following the specifications supplied by the manufacturer. 30 ul 3p21.3 DNA probe was added to 8.0 ul human Cot-I DNA, 1.0 ul placenta DNA, 3.9 ul NaAoc., and 86 ul 100% EtOH. The cocktail was vortexed and briefly centrifuged. It was stored in a −70° C. freezer for 15 min. Next, it was centrifuged in a temperature controlled chamber at 4° C. for 20 min×136,000 r.p.m. The resulting DNA pellet was air dried for 20 min and dissolved in 60 ul of hybridization buffer. Each slide prepared for FISH analysis requires 10 ul of hybridization buffer. To the hybridization buffer, 4 ul CEP 3 probe (spectrum orange) was added. The final solution was denatured in a 75° C. water bath for 5 min and then placed in a 37° C. water bath for 30 min.

FISH Method. Slides were pretreated in a series of 0.1 N HCI-0.2% Triton-X100 in 2×SSC (15 min RT¹, Vibra²) 2×SSC (2 min, RT, Vibra), 1×PBS (2 min, RT, Vibra), 1% Formaldehyde (2 min, RT, Vibra), 1% Formaldehyde (4 min RT, No Vibra), 1×PBS ((2 min, RT, Vibra), and 2×SSC ((2 min, RT, Vibra). Next, they were denatured in 70% formamide/2×SSC, pH 7.3 for 5 min in a 74° C. water bath. Trial and error showed that a temperature of 74° C. was critical for the production of quality slides for FISH. After denaturation, the slides were dehydrated in a cold alcohol series. They wee then subjected to a protease K digestion at 37° C. for 9 min and dehydrated again in a cold alcohol series. After air drying the slides, 10 ul of the 3p21.3 probe prepared in step 2.5 was applied to each slide. They were covered with glass cover slips, sealed with rubber cement, and allowed to hybridize overnight at 37° C.

Post-washing and Immunohistochemical Labeling. After overnight hybridization, post-hybridization washes occurred in three stages with two stages of antibody labeling in between the washes. The first wash consisted of three rinses in 50% Formamide/2×SSC at 45° C. for 10 min. each, two rinses in 2×SSC at 45° C. for 10 min. each, and one rinse 2×SSC at room temperature for 10 min. The slides were then blocked with 50 ul/slide 4×SSC+1% BSA blocking solution for 5 min. Afterwards, the primary antibody was diluted in a 1:20 ratio with the blocking solution and 50 ul was added per slide for 30 min in the dark. The slides were covered with paraffin cover slips to concentrate the blocking and antibody solutions over the cellular areas. The second wash consisted of one rinse in 4×SSC at RT for 10 min., one rinse in 4×SSC+1% Triton at RT for 10 min., one rinse in 4×SSC at RT for 10 min, and one rinse in PN at RT for 10 min. The slides wee then blocked again and labeled with the secondary antibody, which was prepared in a 1:100 ratio with the blocking solution. The labeling reaction was permitted to occur for 60 min. in the dark. The final wash consisted of three rinses in PN at RT for 10 min. each. Interphase cells wee counterstained with 1 ug/ml DAPI containing antifade solution. Ten microliters of DAPI counterstain were added to each slide.

Visualization and Scoring of FISH Signals. Hybridization sites were analyzed using Nikon microscopes equipped with appropriate filter sets for visualizing spectrum green and orange as well as DAPI counterstain. At 100 nuclei from each slide were scored using a triple filter. Each cell was scored individually for the number of RPL 14 signals (spectrum green) and the number of corresponding CEP3 (spectrum orange) signals. To avoid misinterpretation due to inefficient hybridization, cells were counted only if at least one bright CEP3 signal and one bright RPL14 signal were present to avoid false monosomies or deletions due to insufficient hybridization efficiency. Only non-overlapping, intact nuclei were scored. Split centromere signals were counted as one, and minor centromere signals wee disregarded. The inventors used Mantle Cell Lymphoma cells as a negative control.

TABLE 1 Lung Cancer Patients Deletion Rates of the 3p21.3 probe containing RPL14, CD39L3, PMGM, and GC20 gene and Patient Survival Deletion Percent Expired Adenocarcinoma Cases 1a  3% dead 1b 10% 2a  2% alive 2b 11% 3a  8% alive 3b 16% 4  50% dead 5   2% dead 6a 14% alive 6b 30% 7  44% dead Squamous Cases 8a  4% dead 8b 44% 9a  8% alive 9b 64% 10a   6% alive 10b   8% 11  58% dead 12a   6% alive 12b  10%

Discussion. Table 1 provides an organized view of 12 patents suffering form lung cancer. The patients were separated into two different groups: those with adenocarcinoma and those with squamous cell carcinoma. For example, 3b represents cells isolated from a bronchous tumor via punch biopsies of paraffin embedded tissue blocks. The partner number 3a represents cells isolated from the same paraffin block, but from a nontumorous bronchous. Next, two types of cell samples were isolated per patient. Using FISH techniques directed to the Centromere 3 and the RPL14 gene of dissociated cells, the inventors were able to determine the deletion rate of the RPL14 gene in all patients. Initial data shows a promising correlation between the deletion percentage and survival of a patient.

Example 2-Retrospective Study of Lung Cancer Using 3p21.3 Gene Probe and FISH Detection

From an initial population of 200 patients studied retrospectively with Stage I lung cancer (culled from >13,000 patient files 1987-1988) the inventors identified 100 patients who had relapsed or died within 5 years. Additionally the inventors obtained archival bronchial tissue from 100 patients with lung tissue removed for reasons other than cancer which formed the basis of the control group. A detailed demographic history including smoking status, occupational history and family history of cancer was obtained for each patient.

The RPL14 gene probe (located on 3p21.3). Specific primer was designed based on the gene sequence with Electronic-PCR software. The genomic clone of the gene was isolated from the CITB human BAC DNA library pools (Research genetics, Huntsville, Ala.) using PCR technique with this specific gene primer.

Isolation of genomic DNA of 3p21,3. Growth of the positive BAC clone and isolated gene RPL14 genomic DNA using Qiagen Plasmid Kit as instructed by the manufacture. The gene DNA sequence was confirmed by using PCR with the same gene primer. Localization of the RPL14 gene on chromosome 3 was confirmed by using normal metaphase FISH. Preparation of specific gene FISH probes were prepared using a Nick Translation System (Life Technologies) as instructed by the manufacturer. If the banding patterns of the gel showed the probe to be between 200-1000 bp, then the probe would be ready for precipitation.

The BAC clone that contained genomic sequences that have the highest frequency of deletion at 10q region in tumor cell lines was selected. The DNA of this BAC clone isolation and probes labeling procedure were performed as above.

Tissue samples. Punch biopsies of histologically representative cancerous lung tissue and adjacent normal or histologically abnormal bronchial epithelial tissues were performed on paraffin embedded blocks. Resulting tissue sections were digested to obtain cell dissociation of interphase nuclei according to the Hedley technique. A subset of cases had imprints obtained form tumor and adjacent bronchus. Cells were fixed in FISH fixative which is Carnoy's solution (methanol and acetic acid in a 3:1 ratio).

FISH Studies. Dual color FISH studies were performed with the Spectrum Orange centromeric probes for chromosome 3 (CEP 3) (Vysis) and digoxigenin labeled specific RLP 14 gene or the Spectrum Orange centromeric probes for chromosome 10 (CEP 10) (Vysis) and Digoxigenin labeled specific 10q22 probes. CEP 3 or 10 probes were used as control probes respectively.

Slides were denatured in 70% formamide at 73° C. for 5 min. A mix of probes were denatured for 5 min at 75° C. and then applied to slides. After overnight hybridization at 37° C., post-hybridization washing were as follows: 50% Formamdie/2×SSC at 45° C. for 5 min. Digoxigenin labeled specific gene or 10q22 probes are detected by FITC conjugated sheep antidigoxigenin.

Interphases were counterstained with DAPI or PI counterstaining antifade solution. Hybridization sites were analyzed using Nikon microscopes equipped with the appropriate filter sets for visualizing spectrum green or orange as well as nuclei Counterstain. At least 200 nuclei with signals from each probe were scored using a triple filter. Slides were analyzed only if 80% of the cells were interpretable in the field of view. Only non-overlapping, intact nuclei were scored. Split centromeric signals (distance between two signals was equal or less than 0.5 um) were counted as one, and minor centromeric signals were disregarded. Normal lymphocytes were used as external control Deletion (%) was defined percent of cells with fewer signals of specific probe than signals of CEP3 or CEP10 in 200 cells counted.

Results. To date, the inventors have examined numerous dissociated tumors with their adjacent bronchi as well as numerous controls. Based on the results using DNA probes from 3p and 10q, the inventors have shown that the probes most likely are detecting tumor suppressor genes that are lost early on in tumorigenesis, are associated with smoking and appear to predict for the development of non-small cell lung cancer as well as for its overall survival. In addition, the inventors have shown that non-smokers who develop lung cancer have much higher rates of deletions, higher even than smokers (FIG. 1-2) and that these results are statistically significant (p<0.001).

FIG. 1 and FIG. 2 show the 3p21 and 10q22 deletion rates in adjacent bronchial epithelial cells of patients with benign lung disease, patients who developed stage 1 non-small cell cancer that did not relapse, and patients with stage 1 non-small cell cancer with relapse. Note that patients who relapsed had a much higher level of deletions than those patients who did not relapse, regardless of smoking status.

3p and 10q deletions were frequently expressed in lung tumors and showed no correlation with relapse, however the presence of 3p and 10q abnormalities in adjacent bronchial tissue was strongly correlated with relapse (0.09, and 0.0279) and survival (p=0.0348). Therefore, the probes may be useful markers in smoking-related damaged epithelium for risk assessment and for monitoring the efficiency of chemopreventive regimes.

Example 3 Lung Cancer Susceptibility in Former Heavy Smokers

A subset of bronchial lavages from former heavy smokers who had quit for an average of 6 years with median pack year history of 46 years was studied for lung cancer susceptibility. The study patients have surveillance bronchoscopy followed by blind biopsies of main bronchi from both lungs. Following this, a bronchial wash was performed, and triaged. Even though all these patients had quit smoking (average 6 years previously) most showed significant deletions for the 3p21 or the 10q22 FISH probes in bronchial wash specimens, indicating that in genetically susceptible individuals molecular defects appear to be persistent and are not related to the number of pack years.

Example 4 Serial Dilution of Tumor Cells

To detect how low concentration of tumor cells in the bronchial washing sample can be detected and the actual number and distribution of the gene in individual morphological cells, a serial dilution was done for evaluating the sensitivity of the 3p21.33 FISH probe. This test was also for quality control purpose. Two cell lines were used for the serial dilution experiment. H-1792 lung adenocarcinoma cell line was obtained from ATCC, the cell line exhibited cytogentic abnormalities including trisome chromosome 3 and 3p21.33 deletion. By FISH analysis, the cell demonstrated that over 100% of the interphases had 3 signals of CEP3 in contrast to 2 signals of 3p21.33 with 3p21.33/CEP11 probes. The normal bronchial epithelial cell was derived from a normal individual, showing normal number and structure of chromosome 3. H-1792 cells were mixed with same number of normal epithelial to dilute H-1792 cells to 50%. A serials of dilution was performed to further dilute H-1792 cell to 25%, 12.5%, 6.3%, 3.1%, 1.6%, 0.8%. The slides were made by cytospin preparations and randomized before hybridization. After hybridization and post washing, the percentage of cells with deletion of 3p12.33 signals were counted and compared with the projected values, as shown in FIG. 4.

Results of the serial dilution experiment demonstrated that the dilution concentration was positively related to the percentage of 3p21.33 deletion cells detected by FISH. However, when tumor cell line were diluted to a concentration <3.1%, it was not possible to identify 3p21.33 deletion cells, suggesting that the sensitivity of the probe or the lowest concentration of positive cells detected by the probe in bronchial washings was 3.1%.

Example 5 Progression of GC20 Study

After narrowing down the critical gene region in 3p21, the novel gene SUI1/GC20 (SEQ ID NO: 7) was identified in the region. SUI1/GC20 is a homolog of the SUI1 gene, which is a superfamily consisting of a growing number of proteins; SUI1 is a 113 amino-acid polypeptide similar to the protein from various different species. Primarily, the SUI1 gene product was believed to be a monitor translational accuracy protein by recognition of the protein synthesis initiation codon. Recent studies demonstrated that the SUI1 protein has a role in the nonsense-mediated mRNA decay pathway, by which cells have evolved elaborate mechanisms to rid themselves of aberrant proteins and transcripts. Identification of a stress-inducible cDNA of SUI1 suggested that modulation of translation initiation occurs during cellular stress and may represent an important adaptive response to genotoxin (e.g., tobacco) as well as endoplasmic reticular stress. SUI1 was expressed in normal liver but not in liver carcinoma cells. Introduction of SUI1 into liver carcinoma cells inhibited cell growth in vitro and partially inhibited tumor formation in nude mice. It is rational to suggest, therefore, proteins of the SUIL family possess tumor-suppressing properties and may represent a primary event, rather than a consequence, of tumorigenesis. Furthermore, since deletion of 3p21.3 was found by others to be the earliest acquired genetic changes in the pathogenesis of lung cancer, inventors also found that SUI1/GC20 transcript was diminished in all lung cancer cell lines tested by reverse transcription-PCR(RT-PCR). Inventors have cloned the full-length cDNA of SUI1/GC20 into a constitutive (pcDNA3.1/GS) with the C-terminal V5 epitope and polyhistidine (6×His) tag (SEQ ID NO: 9). The first four nucleic acids of SEQ ID NO: 9, “cacc” were added to the insert before the ATG by including the sequence in the forward primer (SEQ ID NO: 11), in order to conform to the to the consensus Kozak sequence for optimal translation initiation. The reverse primer is given in SEQ ID NO: 10 The last 102 nucleic acids of SEQ ID NO: 9 is not part of the insert, but is derived from the vector pcDNA3.1/GS, and codes for the V5/6x His tag.

The non-small cell lung cancer cell line H1972 was transfected with the full-length cDNA of SUI1/GC20 (SEQ ID NO: 9), resulting in H-1972 pcDNA3.1/GC20 or with the vector pcDNA3.1/GS (resulting in H-1972 pcDNA3.1) as control. Protein expression of GC20 was detected in the H-1972 pcDNA3.1/GC20 cells, but not in control cells. The growth of H-1972 pcDNA3.1/GC20 cells in serum-containing medium was significantly slower than that of H-1972 or the vector-transfected control H-1972 pcDNA3.1.

Studying the molecular genetic mechanisms by which SUI1/GC20 is inactivated can be done to characterize the gene. Analyze of the function of the gene can also be used to demonstrate its ability to inhibit cell growth and suppress tumorgenicity. As results of these studies, there is a better understanding of the tumorigenesis of tobacco-related lung cancer and the clinical biomarkers useful for its early detection and risk assessment.

Example 6 FISH Studies on Bronchial Wash Specimens from Patients with Benign, Atypical and Malignant Cytology Using a 3p21.3 DNA Probe

Bronchial wash specimens were tested for deletions of 3p21.3 by FISH using the locus-specific probe for 3p21.3 together with a centromeric probe for chromosome 3 as control. Inventors tested patients with non-small cell bronchogenic carcinoma, and patients on a chemopreventive protocol who demonstrated by cytology metaplasia, reserve cell hyperplasia or no abnormality. Also, the presence of the 3p deletion was correlated with the number of pack-years of smoking or tobacco-use.

Negative Cytology 7 cases with negative cytology showed levels of deletions of 3p21.3 between 0%-13% (mean 7.2±0.05) when tested with the 3p21.3 probe. The highest level of deletion was associated with a 122.5 pack year history of smoking. Interestingly, this high level of deletion was noted in two specimens, 6 months apart, from the same patient, indicating that there is a consistent deletion that did not response to fenretinamide (or cis-retinoic acid) therapy that was used as a chemopreventive agent.

Atypical Cytology There were 6 cases with cytological evidence of either reserve cell hyperplasia or squamous metaplasia/atypical metaplasia. The highest level of deletion was noted in a 90-pack year smoker. The deletions ranged from 7% to 15% (mean=10.5%±0.036).

Carcinoma In the third category of patients with cytological evidence of carcinoma (2 squamous carcinoma, 1 adenocarcinoma), the mean percent deletion was 17±0.13 (range: 8%-23%).

Results These results showed that in patients without evidence of lung cancer/squamous atypia, who had a history of smoking, a deletion of 3p21.3 existed that roughly paralleled the number of pack years smoked indicating that this deletion may occur secondarily to exposure to tobacco smoke, and also may be an early event in neoplastic transformation. None of these patients have yet to evidence clinical or straight chest X-ray evidence of lung cancer, however, those with the highest levels of deletion may be at high risk to develop neoplasia.

In patients with atypia as manifested by squamous metaplasia or atypia, the level of deletion was higher than in the negative group, with the highest levels of deletion noted in patients with carcinoma.

The results also correspond with previous studies with 3p21.3 probe for chromosomal aberrations in microdissected lung carcinomas and adjacent “normal” bronchial cells. Genetic instability is a very early event in tumorigenesis and chromosomal numerical abnormalities are associated with smoking. 3p21.3 deletions occurred more frequently in the lung tumors and adjacent bronchi of the patients who smoked than in control lung tissue from patients who did not smoke. Smoking may cause molecular damage much earlier than the corresponding manifestation of neoplasia at a morphologic level. Smoking is a major etiologic factor for the development of lung cancer and based on the studies presented herein, the loss of 3p21.3 is an early event in the tumorigenesis of lung cancer.

The 3p21.3 probe will be a useful marker in monitoring smoking-related target epithelia to measure risk assessment and for monitoring the efficiency of chemo-prevention therapy in high-risk former or current smokers.

TABLE 2 Results of FISH Studies on Bronchial Wash Specimens from Patients with Benign, Atypical and Malignant Cytology Using a 3p21.3 DNA Probe. DATE SMOKE HX BW # MDA # Received (PackYears) DIAGNOSIS 3pFISH Benign 87 228891 May 11, 2000 67.5 No slide  0% 179 398860 Oct. 12, 2000 122.5 Negative 12% 212 398860 Jan. 12, 2001 122.5 Negative 13% 247 451844 May 1, 2001 45 Negative 12% 249 424531 May 4, 2001 44 Negative  5% 257 459858 Jun. 26, 2001 39 Negative  8% 258 458362 Jul. 12, 2001 26 Negative  1% Atypia 31 413570 Jan. 19, 2000 87.5 Metaplasia 12% 146 385669 Sep. 1, 2000 Non-Smoker Metaplasia  8% 243 475347 Apr. 26, 2001 38 Metaplasia 14% 244 474853 Apr. 26, 2001 30 Metaplasia  7% 246 475666 Apr. 27, 2001 90 Metaplasia 15% 256 429515 Jun. 8, 2001 75 reserve cell  7% hyperplasia Malignant 252 404860 May 8, 2001 60 Sq. CA 11% 127 358000 Aug. 3, 2000 Non-Smoker Sq. CA  8% 210 406098 Jan. 10, 2001 Non-Smoker Ad. CA 32

Example 7 Sensitivity of the 3p21.33 FISH Probe in Detecting Lung Cancer Cells in Bronchial Wash Specimens

It was hypothesized that deletions of 3p.21.33 may be detected early on in carcinogenesis, and may thus have the potential to predict a patient's predisposition towards developing either primary lung cancer or a relapse thereof. The purpose of this study was to explore the efficacy of the FISH test, specifically the sensitivity of the 3p21.33 probe, for determining 3p21.33 deletions in interphase cells from patients' bronchial samples with the aim of developing a method for determining genetic predisposition to lung cancer. The sensitivity of the outcome depended on the visibility of the 3p21.33 gene loci as well as the ability to detect deletions of the 3p21.33 locus in the malignant cell lines compared to the admixed normal bronchial cells.

Cell Samples and Slides The tumor cells in this study were obtained from an H-1792 lung adenocarcinoma cell line obtained from ATCC, which exhibited cytogenic abnormalities including trisomy of chromosome 3 and 3p21.33 deletion. These cells were separated and harvested from culture bottles and diluted from a concentration of 2.52×10⁶ cells per ml to 7.14×10⁵ cells per ml using PBS buffer. Normal bronchial epithelial cells, showing normal numbers and structure for chromosome 3, were acquired from the bronchial wash of a normal individual at a concentration of 7.14×10⁵ cells per ml. The cancer cell sample was diluted by the normal cells to concentrations of 0%, 0.8%, 1.6%, 3.13%, 6.25%, 12.5%, 25%, 50%, 75%, 87.5%, 93.75%, 96.8%, 98.4%, and 100% and transferred onto individual slides using a cytospin preparation.

Nick Translation The DNA probe used to identify the 3p21.33 gene was created using nick translation, a widely used method for its ease in controlling fragment size (Wilkinson, 1998) Digoxigenin enzyme, which is able to be detected with antibodies, was used to cut 3p DNA (Andreeff et al., 1999). The probe was tested using gel electrophoresis to ensure a length of 200-500 kilobase pairs. CEP3 probe (chromosome 3 centromere) was premixed and provided by a commercial company (Vysis, Downers Grove, Ill.).

FISH Method The 3p21.33 probe was precipitated by mixing with human cot-1 DNA (Vysis, Downers Grove, Ill.), human placenta DNA (Sigma, La Jolla, Calif.), NaOAcetate, and 100%-20 ethanol, incubated at −80° C. for 15 minutes, and centrifuged for 20 minutes at 4° C. The remaining pellet was dissolved in hybridization buffer (Vysis, Downers Grove, Ill.) at room temperature, at 10 μl per slide, and the centromeric probe was added. The probe was placed in a 75° C. water bath for 5 minutes and then transferred to a 37° C. water bath for 20 minutes. At the same time, the slides to be tested were placed into a 70% Formamide/SSC solution for 3 to 4 minutes to denature the DNA and subsequently placed into a series of cold ethanol jars in order to permeabilize the cells by removing the lipid membranes (Wilkinson, 1998). 10 μl of the probe was then pipetted onto each slide and put into a humidity box to incubate overnight at 37° C. The following day, the slides were first placed into 3 jars of 50% Formamide/2×SSC at 45° C. for 10 minutes each and then into a jar of 2×SSC for 10 minutes at 45° C. as well. The slides were blocked for five minutes using 4×SSC/BSA and then covered with the first antibody (anti-digoxigenin) and placed in a humidity box for an hour. This was followed by a series of TNT buffer washings. Once again the blocking procedure was performed before the second antibody was placed on the slides for another hour. The slides were washed again with TNT buffer. Lastly, 10 μl of DAPI was added to the slides to stain the nucleus of each cell.

Visualization and Counting of FISH Signals The hybridized slides were examined utilizing a Labophot-2 microscope (Nikon, Tokyo, Japan) under filters for visualizing green, orange, and DAPI fluorescence signals (FIGS. 5-9). One hundred cells were selected for analysis from each dilution, and were counted only if the entire nucleus was distinct and intact. To avoid misinterpretations owing to false monosomies or deletions due to insufficient hybridization, nuclei were counted only if at least one bright CEP3 signal (orange) and one bright 3p21.33 signal (green) were present. The numbers of orange signals versus green signals were counted and recorded individually for each chosen cell. A particular cell was deemed a normal epithelial cell if it had an equal number of green and orange signals, signifying that there was one 3p.21.33 gene per centromere on the chromosomes. In contrast, cells were identified as tumor cells if they possessed fewer green signals than orange signals, proving that there, indeed, was a deletion of the 3p21.33 gene within its genomic makeup.

Results and Discussion Typically, normal epithelial cells displayed 2 orange and 2 green signals, whereas tumor cells showed 3 orange and 2 green signals (FIGS. 5-9). Data indicates that the baseline sensitivity for FISH detection of deletions of 3p21.33 is 3.13% (see Example 4), this places high hopes for early detection of the development lung cancer, recurrence, or metastasis. With extended studies, the FISH probe for 3p21.33 can be established and widely used as a useful and reliable marker for assessing lung cancer and, perhaps, further utilizing and improving methods like fine needle aspiration in conjunction with the FISH method. This study also lays the baseline for future studies and for the monitoring of preneoplastic or neoplastic events and may be used as a surrogate intermediate biomarker in chemoprevention techniques in lung cancer.

Example 8 Statistical Analysis of 10Q Probe

FIG. 10 and FIG. 11 provide a predicted probability of relapse and long term survival for patients. Using data from 96 patients, the deletion of 10Q in bronchial epithelial cells adjacent to the tumor cells is compared with both relapse (FIG. 10 and long term survival (FIG. 11). 10Q deletion is a significant predictor of relapse.

In FIG. 12, the proportion of patients who are relapse free at times ranging from 0 to 108 months is shown. The data is divided into a set of patients who have a N10q value of greater than 5 and patients who have a N10q value of less than or equal to 5. While about 40% of the patients with N10q>5 are relapse free after a long interval (5-7 years), over 60% of the patients with N10q<5 are relapse free after the same time interval.

REFERENCES

The following references, to the extent that they provide exemplary procedural or other details supplementary to those set forth herein, are specifically incorporated herein by reference:

-   Wilkinson; D. G. In Situ Hybridization: A Practical Approach. New     York, Oxford: 1998. -   Andreeff, M. D., Ph.D., Michael, Pinkel, Ph.D., Daniel. Introduction     to Fluorescence in Situ Hybridization Principles and Clinical     Applications. New York, Wily-Liss: 1999. -   Alberola et al., Proc. Annu. Mt. Am. Soc. Clin. Oncol., 14: A1094,     1995. -   Auerbach et al., N. Engl J. Med., 265: 253-267, 1961. -   Ayabe et al., Lung Cancer, 11(3-4): 201-208, 1994. -   Barinaga, Science, 271: 1233, 1996. -   Brugal et al., Method. Achiev. Exp. Pathol., (Karger, Base1) 11:     1-33, 1984. -   Carcy et al., JNCI, 65: 1225-1230, 1980. -   Carriaga et al., Cancer, 75: 406-421, 1995. -   Cheon et al., Yonsei Med. J., 34(4): 365-370, 1993. -   Dalquen et al., Virchows Archiv., 431(3): 13-179, 1997. -   Dong et al., Science, 268: 884-886, 1995. -   Ekins, R.; Chu, F. W., Trends in Biotechnology, 17: 217-218, 1999. -   Fearon et al., Science, 247: 47-56, 1990. -   Feder et al., Cancer Genet. Cytogenet., 102: 25-31, 1998. -   Feinstein et al., Am. Rev. Repir. Dis., 101: 671-684, 1970. -   Field et al., Cancer Res. 59: 2690, 1999. -   Fodor et al., Science, 251:767-773, 1991. -   Fontanini et al., Cancer, 70(6): 1520-7, 1992. -   Frohman, M. A., In: PCR PROTOCOLS: A GUIDE TO METHODS AND     APPLICATIONS, Academic Press, N.Y., 1990 -   Hacia et al., Nature Genetics, 14:441-447, 1996. -   Hirano et al., American J. Path., 144(2): 296-302, 1994. -   Hirsh, Manksgaard, 1-78, 1983. -   Holmstrom et al., Anal. Biochem. 209:278-283, 1993. -   Hosoe et al., Lung Cancer, 10: 297, 1994. -   Ichinose et al., J. Surgical Oncology, 46(1): 15-20, 1991. -   Ihde, Curr. Prob. Cancer, 15: 65, 1991. -   Kim et al., Korean J. Intern Med., 11(2): 101-7, 1996. -   Kwoh et al., Proc. Nat. Acad. Sci. USA, 86: 1173, 1989. -   Licciardello et al., Int. J. Radiat. Oncol. Bio. Phys., 17: 467-476,     1989. -   Liewald et al., Chirurg, 63(3): 205-10, 1992. -   Liflon, Science, 272: 676, 1996. -   Macchiarini et al., Proc Annu Mt. Am. Soc. Clin. Oncol. 11: A995,     1992. -   Miki et al., Science 266: 66-71, 1994. -   Mitsudomi et al., Clin. Cancer Res., 2(7): 1185-9, 1996. -   Miyamoto et al., Cancer Research, 51(23pt1) 6346-50, 1991. -   Morahan et al., Science 272: 1811, 1996. -   Mrkve et al., Anticancer Research, 13(3): 571-8, 1993. -   Muguerza et al., World J. Surg. 21(3): 323-329, 1997. -   Naruke et al., J. Thorac. Cardiovas Surg, 96: 400, 1988. -   Newton, C. R, et al. Nucl. Acids Res. 21:1155-1162 (1993). -   Ohara et al., Proc. Nat'l Acad. Sci. USA, 86:5673-5677, 1989. -   Pantel et al., Proc. Annu Mt. Am. Soc. Clin. Oncol., 12: A941, 1993. -   Papadimitrakopoulou et al., Cancer and Metastasis Reviews, 15:     53-76, 1996. -   Pease et al., Proc. Natl. Acad. Sci. USA, 91:5022-5026, 1994. -   Pence et al., Archives of Surgery, 128(12): 1382-1390, 1993. -   Pignon et al., Hum. Mutat., 3: 126-132, 1994. -   R. Ekins and F. W. Chu, Trends in Biotechnology, 17: 217-218, 1999. -   Rasmussen, et al., Anal. Biochem, 198:138-142, 1991. -   Rice et al., J. Thoracic Cardio. Surgery, 106(2): 201-217, 1993. -   Running. J. A. et al., BioTechniques 8:276-277, 1990. -   Sahin et al., Cancer, 65(3): 530-7, 1990. -   Saiki et al., Science, 239: 487-491, 1988. -   Sambrook et al., (ed.), Molecular Cloning, Cold Spring Harbor     Laboratory Press, Cold Spring Harbor, N.Y., 1989. -   Satoh et al., Mol. Carcinog., 7: 157, 1993. -   Shiseki et al., Genes Chromosomes Cancer, 17(2): 71-7, 1996. -   Shoemaker et al., Nature Genetics 14:450-456, 1996. -   Shriver et al., Mutat. Res. 406(1): 9-23, 1998. -   Sidransky et al., Science, 252: 706-709, 1991. -   Siest et al., J. Cellul. Biochem., 28/29: 64, 1997. -   Slamon et al., Science, 244: 707-712, 1989. -   Taparowsky et al., Nature, 300: 762-764, 1982. -   Thiberville et al., Cancer Research, 55: 5133-5139, 1995. -   Thiberville et al., Int. J. Cancer, 64: 371, 1995b. -   Travis et al., Cancer, 75: 191-202, 1995. -   Valdivieso et al., Proc. Annu. Mt. Am. Soc. Clin. Oncol., 13: A1121,     1994. -   Vallmer et al., Hum. Pathol., 16: 247-252, 1985. -   VanOijen et al., Cancer Epidemiology, Biomarkesr & Prevention, 9:     249, 2000. -   Vo-Dinh, et al., Anal. Chem., 66: 3379-3383, 1994. -   Volm et al., Versicherungsmedizin, 41(1): 2-5, 1989. -   Voravud, et al., Cancer Research, 53: 2874-2883, 1993. -   Walker et al., Nucleic Acids Res. 20(7):1691-1696, 1992. -   Wistuba et al., Cancer Res., 60(7): 1949-60, 2000. -   Wu et al., Cancer Res., 58(8): 1605-8, 1998. -   Yamaoka et al., J. Japan Surgical Soc., 91(10): 1608-16, 1990. -   Yanagisawa et al., Cancer Research, 56: 5579-5582, 1996. -   Zou et al., Clinical Cancer Research 4: 1345-1355, 1998. 

1-88. (canceled)
 89. A method for identifying a subject having non-small cell lung cancer or at risk for the development, recurrence, or metastasis of non-small cell lung cancer comprising: (a) obtaining a test sample from a human subject; (b) providing a 3p21 DNA probe; (c) contacting the probe with the test sample; (d) analyzing DNA from the test sample, (e) detecting a loss of heterozygosity in the hybridization of the probe to the DNA, as compared to a centromeric DNA probe for chromosome 3; and (f) identifying the subject having non-small cell lung cancer or at risk for the development, recurrence, or metastasis of non-small cell lung cancer when loss of heterozygosity is detected.
 90. The method of claim 89, wherein the 3p21 DNA probe comprises a CD39L3, PMGM, and GC20 genes.
 91. The method of claim 90, wherein the 3p21 DNA probe further comprises a RPL 14 gene.
 92. The method of claim 89, wherein the test sample comprises a surgical or biopsy specimen, a paraffin embedded tissue, a frozen tissue imprint, a sputum, esophageal brush, a fine needle aspiration, a buccal smear or a bronchial lavage.
 93. The method of claim 89, wherein the subject is a smoker.
 94. The method of claim 89, wherein the subject is a former smoker.
 95. The method of claim 89, wherein the subject is a non-smoker.
 96. The method of claim 89, wherein the test sample comes from the subject who has not previously been diagnosed with cancer.
 97. The method of claim 89, wherein the probes are labeled with fluorophores.
 98. The method of claim 89, wherein one of the probes is labeled with digoxigenin.
 99. The method of claim 89, wherein size of the probes is between 1000 and 2000 base pairs.
 100. The method in claim 89, further comprising performing a spiral CT-scan.
 101. The method of claim 89, wherein the method is used to identify subjects who need an intensive follow-up protocol.
 102. The method of claim 89, wherein the probes are used to identify subjects who are suitable for novel investigational therapeutic approaches.
 103. The method of claim 30, wherein said centromeric probe is labeled with a fluorophore.
 104. The method of claim 103 wherein said centromeric probe is labeled with spectrum orange.
 105. The method of claim 30, wherein said centromeric probe is a chromosome 3 stable marker.
 106. The method of claim 105, wherein said centromeric probe is Centromere 3 (CEP 3).
 107. The method of claim 89, wherein analyzing comprises using FISH.
 108. The method of claim 89, wherein the probes are used as biomarkers for the early detection of early neoplastic events or cancer.
 109. A method for predicting the progression or metastasis of non-small cell carcinoma in a subject comprising: (a) obtaining a lung test sample from a human subject; (b) providing a 3p21 DNA probe; (c) contacting the probe with the test sample; (d) analyzing DNA from the test sample, (e) detecting a loss of heterozygosity in the hybridization of the probe to the DNA, as compared to a centromeric DNA probe for chromosome 3; and (0 predicting the development, recurrence, or metastasis of non-small cell carcinoma in the subject.
 110. The method of claim 109, wherein the 3p21 DNA probe comprises a CD39L3, PMGM, and GC20 genes.
 111. The method of claim 110, wherein the 3p21 DNA probe further comprises a RPL 14 gene. 